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Abstract. We show that provided log 50 n/n < p < 1 — n _1,/4 log 9 n we can with 
| high probability find a collection of [5(G) /2J edge-disjoint Hamilton cycles in G ~ 

. G n ,p, plus an additional edge-disjoint matching of size [n/2\ if 8(G) is odd. This 

, ^ is clearly optimal and confirms, for the above range of p, a conjecture of Frieze and 

Krivelevich. 

o 

CN ■ 1. Introduction 



o 



The question of how many edge-disjoint Hamilton cycles one can pack into a given 
graph 67 has a long history, but many of the main questions remain open. More 
U; precisely, we say that a graph G has property % if G contains [5(G) /2\ edge-disjoint 

[ Hamilton cycles, together with an additional edge-disjoint optimal matching if 5(G) is 

odd. (For a graph G on n vertices, call a matching M in G optimal if \M\ = [n/2\.) 
An old construction by Walecki (see e.g. [TJ [26]) shows that the complete graph K n 
has a Hamilton decomposition if n is odd (here a Hamilton decomposition of a graph 
G is a set of edge-disjoint Hamilton cycles which together cover all of the edges of G). 
It is well known that more generally K n has property % (see e.g. [32J). We discuss 
CN ' some further results at the end of the introduction. 

The question was first investigated in a probabilistic setting by Bollobas and Frieze in 
^-j. \ the 1980's (see below). Later, Frieze and Krivelevich [13] made the striking conjecture 

that whp Gn )P has property H for any p = p(n). Here G n ^ p denotes a binomial random 
graph on n vertices where every edge is present with probability p, and we say that 
a property of a random graph on n vertices holds whp if the probability that it holds 
tends to 1 as n tends to infinity. 

Conjecture 1 (Frieze and Krivelevich [13]). For anyp = p(n), whp a binomial random 

^ . graph G ~ G n ^ p has property %. 

H ■ 

Our main result confirms this conjecture as long as p is not too small and not too 
large. 

Theorem 2. Let log n n < p < 1 — n -1 / 4 log 9 n. Then whp G n . p has property %. 

We deduce Theorem [2] from a purely deterministic result (Theorem I48p which states 
that every graph which satisfies certain pseudorandomness conditions and which is 
close (but not too close) to being regular has property %. Our proof shows that if 
both 5(Gn tP ) and n are odd, then the optimal matching guaranteed by property % can 
be chosen to cover the vertex x of minimum degree. So the Hamilton cycles and this 
matching together cover all the edges at x. 

We now discuss some related results on packing edge-disjoint Hamilton cycles in 
random graphs. Most of these actually consider the slightly weaker property Us of 
containing [5(G) /2\ edge-disjoint Hamilton cycles. Let G„ jm denote a random graph 
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chosen uniformly from the set of graphs on n vertices with m edges. Bollobas and 
Frieze [7J showed that an analogue of Conjecture [1] holds in the model G n ^ m when 
2m/n = pn < log n + 0(loglog n). In this range of m, whp 5(G n ^ m ) is bounded. Their 
result generalizes a result of Bollobas [5], which gave a 'hitting time' result for the 
property that G n ^ m contains a Hamilton cycle. Frieze and Krivelevich [13] extended 
the range of p and showed that property Tis holds for all p with pn = (1 + o(l)) log n. 
Recently, this was further extended by Ben-Shimon, Krivelevich and Sudakov [4] to all 
p < 1.02 log n/n. In this range whp G n>p is far from being regular. In fact, as noted 
in [3] whp we have 5{G n ^ p ) < np/300, and so only a small fraction of the edges of 
G UtP are contained in the Hamilton cycles guaranteed by property Tis- Using different 
ideas than in the present paper, very recently Krivelevich and Samotij [21] were able 
to cover the range log n/n < p < n~ 1+£ . Finally, the 'very dense' case was settled 
by Kiihn and Osthus [25j . who covered the range when p > 2/3. So altogether, these 
results cover the entire range of probabilities p and show that whp Tis holds for any p. 
We emphasize that the 'main' range of probabilities is covered by [21] and the current 
paper. 

As soon as pn 3> logn, whp we have S(G ntP ) = (1 + o(l))np. So in this range, 
Gn :P is close to being regular and so the above results yield an 'approximate' Hamilton 
decomposition of G njP , i.e. a set of edge-disjoint Hamilton cycles covering almost all 
edges of G n ^ p . Such an approximate result for constant p was first proved by Frieze 
and Krivelevich [12] , In [20J, we extended this approximate result to all pn 3> logn. 
This was also proved independently by Krivelevich (personal communication). 

As mentioned earlier, not many graphs are known which have property Ti. Some 
examples are discussed in the surveys [HE] - mainly these deal with classes of graphs 
which have a high degree of symmetry. A more recent result in the area is due to 
Alspach, Bryant and Dyer [2], who showed that Paley graphs have property Ti. Kim 
and Wormald [19] proved that whp property Ti holds for random regular graphs of 
fixed degree. Nash- Williams [27] conjectured that any d-regular graph on n vertices 
where d > n/2 has [d/2\ edge-disjoint Hamilton cycles. (So such graphs would have 
property Ti if d is even.) An approximate version of this conjecture was proved by 
Christofides, Kiihn and Osthus [9]. 

A very general result in this direction was obtained very recently by Kiihn and 
Osthus |24j . who showed that every regular 'robustly expanding' digraph of linear 
degree has a decomposition into edge-disjoint Hamilton cycles. The initial motivation 
for this result was that it implies that every (large) regular tournament has a Hamilton 
decomposition, which proved a long-standing conjecture of Kelly. However, as observed 
in [25], the main result of |24j has a number of further applications. For instance, it can 
be used to show an analogue of Conjecture Q] for random tournaments, which confirms 
a conjecture of Erdos. It can also be used to improve the above result in [9]. Similarly, 
the result in [25J which confirms property Tis for G ntP in the 'very dense' case is also 
derived from the main result of |24j . 

Hypergraph versions of Conjecture Q] were also recently considered by Frieze and 
Krivelevich [14] . Frieze, Krivelevich and Loh [15] and Bal and Frieze [3]. 

A related line of research was initiated by Glebow, Krivelevich and Szabo [16J, who 
investigated the 'dual' problem of finding the smallest set of Hamilton cycles which 
together cover all edges of G n)P . As is whp not regular, this means that these 
Hamilton cycles will not be edge-disjoint and the best result one can hope for is a 
set of [~A(G raiP )/2] such Hamilton cycles. The main result in [16] shows that this 
bound is approximately true if p > n~ 1+e . Subsequently, Hefetz, Kiihn, Lapinskas 
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and Osthus [T7] achieved an exact version of this result for a similar range of p as in 
Theorem [2j 

Theorem 3. Let log n n <p<l- n^ 1 / 8 . Then whp the edges of G n ^p can be covered 
by [A(Gr)/2] Hamilton cycles. 

For p as above, this proves a conjecture of |16j . The crucial ingredient in the proof 
of Theorem [3] is the main technical result of the current paper (Lemma [4T]) . Roughly 
speaking, given a graph H satisfying rather weak pseudorandomness conditions and a 
pseudorandom graph G\, which is allowed to be surprisingly sparse compared to H, 
Lemma |4"T1 guarantees a set of edge-disjoint Hamilton cycles covering all edges of H. 

Our proof of Theorem [2] exploits the fact that in the given range of p, there is a small 
but significant gap between the two smallest degrees of G np . This is not necessarily 
the case when npj logn —> oo very slowly (see e.g. [6]). This is one of the reasons 
for our restriction on the range of p in Theorem [2j Our method is based on a new 
technique of iterative improvements which is likely to have further applications. In 
particular, this idea was an essential feature of the proof in [24J. 

2. Sketch of proof and organisation of the paper 

Recall that in [20] we proved an approximate version of Conjecture [H which finds 
a set of edge-disjoint Hamilton cycles covering all but an r\ proportion of the edges of 
G nyP . Roughly speaking, the proof of this approximate result proceeds as follows: 

(1) First we choose < e <C 77 and remove a random subgraph Gthin of density ep 
from G n , p , and call the remaining graph G dense- Gthin wm be a pseudorandom 
graph, which is important in Step 4 below. 

(2) Next we apply Tutte's theorem to find an r-regular subgraph G reg of Grf ense , 
with r = (1 — e)np. 

(3) Using a counting argument, one can show that most 2-factors of G reg have 
few cycles. We remove such 2-factors one by one to obtain a set of (1 — e)r/2 
edge-disjoint 2-factors Fi which cover most edges of G reg (and thus of G n>p ). 

(4) We now use the edges of G t Mn to transform each F, into a Hamilton cycle. 
More precisely, for each i in turn, we swap some edges between G t hin and Fj in 
such a way that the modified 2-factor is in fact a Hamilton cycle. For the next 
2-factor, we use the 'current' version of Gthin- The fact that Fi has few cycles 
means that we do not need to swap out many edges of Gthin f° r this, and so 
Gthin retains the pseudorandomness properties which allow us to perform this 
step. 

Our approach to Theorem [2] uses the above ideas, but some obvious obstacles arise. 
For simplicity, assume that n and 5(G ntP ) are both even in what follows. We can use 
Tutte's theorem to show that G„ iP has an r-regular subgraph G reg with r = 5{G n ^ p ) 
(this is harder than Step 2 but not too difficult for our range of p). So we can decompose 
G reg into r/2 2-factors Fi and would like to transform each Fi into a Hamilton cycle. 
At first sight, this looks infeasible for 2 reasons: 

(a) The counting argument in Step 3 only works for reasonably dense subgraphs 
of G n>p . So the 2-factors produced by the later iterations in Step 3 will have 
too many cycles. 

(b) We no longer have a graph Gthin to use in Step 4. 

For (a), it turns out that one can extend the counting argument so that it also works for 
sparser subgraphs of G n>p , provided these subgraphs are pseudorandom (see Lemmas [241 
and [26]). 
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A very useful observation which goes some way in solving problem (b) is the follow- 
ing: Let xq be the vertex of minimum degree in G n , P - Then whp there is a small but 
significant gap between d(xo) and d(x) for any x ^ xq in G nyP (the gap has size close 
to ^/np). Let Gi e f t be the subgraph of G„ iP consisting of all the edges not in G reg . 
Then Gi e ft has density about y/p/n, and we could try to use Gi e f t to merge 2-factors 
instead of the graph Gthin in Step 4. One problem here is that G\ e ft is not actually 
pseudorandom as it is just the 'leftover' from the Tutte application. 

We use the following idea to overcome this problem. Right at the start, we remove 
the edges of a random subgraph G5 of density p$ from G n>p , where nps <C y/rvp. Let 
xq be the vertex of minimum degree of the remaining subgraph G\ of G njP . The choice 
of £>5 implies that xq will also be the vertex of minimum degree in G njP . Now add all 
edges of G5 which are incident to xo to G\. Then 8{G\) = 5(G n , P ) and it turns out 
that we can still apply Tutte's theorem to obtain an r-regular subgraph G reg of G\ 
(see Lemma [22]) . Again we obtain a decomposition of G reg into 2-factors Fi . 

The key advantage of this method is that G5 — xq will be pseudorandom, and so one 
can use G§ in the same way as Gthin to merge the Fi into Hamilton cycles. However, it 
turns out that G5 is far too sparse to complete the process: if e.g. p is a constant, then 
our bound on the total number of cycles in the Fi is significantly larger than n 3//2 . On 
the other hand, G5 has significantly fewer than n 3 / 2 edges. To reduce the number of 
cycles in Fi by 1, we need about logn edges from G5. So even if we return the same 
number of edges from Fi to G5 each time, we cannot assume G5 to be pseudorandom 
after dealing with even a small proportion of the Fi. 

Our main idea is to use an iterative approach to overcome this. Choose P2, P3 and 
P4 so that P5 <C Pa <C P3 <C P2 "C p- Then choose randomly edge-disjoint subgraphs 
Gi of Gn iP , of density pi, for i = 2,3,4,5. (For simplicity, assume that G2, G3 and 
G4 are regular of even degree in what follows.) Let G\ be the remaining subgraph of 
G n , P (with all of the edges of G5 which are incident to xq added to G\) and consider 
5{G\)/2 edge-disjoint 2-factors of Fi of G\. Then as our first iteration we transform 
all the Fi into Hamilton cycles, where G2 plays the role of Gthin- In the second 
iteration, we do the same for (the leftover of) G2, with G3 playing the role of Gthin, 
and so on until we reach G5. (Note that there is no need to do anything with the 
leftover of G5, as G5 contains no edges incident to xq.) In total, this gives us a set of 
L(5(Gi) + <5(G 2 )+5(G 3 ) + 5(G 4 ))/2j = [d{x Q )/2\ edge-disjoint Hamilton cycles which 
together contain all of the edges incident to xq. 

The new problem that we now face is that e.g. in the second iteration the graph G2 
is a 'leftover' from the first iteration. So its pseudorandom properties are not as strong 
as we need them to be (e.g. in order to apply the strengthened version of the counting 
argument in Step 3) due to the existence of some 'bad edges' which were moved into 
G2 from the Fi during the first iteration. 

In order to deal with this, we perform some intermediate steps (about r = log nj log log n 
per iteration) using yet more random graphs Gr 2 j) (which we must also remove from 
G n , p initially). So first G^,i) plays the role of Gthin when transforming the 2-factors 
of G\ into Hamilton cycles. We then transform (the leftover of) GWi) into Hamilton 
cycles using Gf 2 ,2) and G( 2i3 ), and then the leftover of G( 2j2 ) an d GW3) using G( 2 ,4) 
and G(2,5), etc. Roughly speaking, after r iterations we will have replaced G^,i) by a 
graph G( 2) 2r+l) °f almost the same density as G( 2j i), but containing no bad edges (see 
Lemma I4T|) . So it is now possible to carry out the second iteration (as described in 
the previous paragraphs) with Gy 2j 2t+1) i n place of G 2 and G( 31 ) in place of G3, and 
similarly the third and further iterations. 
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Thus Lemma [37] can be regarded as one of the key statements of the paper, and we 
believe it is of independent interest: as indicated above, it states that given a regular 
graph Hq (which satisfies some fairly weak quasirandomness conditions) and a graph 
H which is the union of many quasirandom graphs G^j), we can find a set of edge- 
disjoint Hamilton cycles in Hq U H covering all edges of Hq, even if H is allowed to be 
much sparser than Hq. 

This paper is organized as follows: In Section HI we define the pseudorandomness 
properties we need in the proof and show that with very high probability they are 
satisfied by G niP . In Section [5] we use Tutte's theorem find an analogue of G reg . In 
Section [6] we use our extended counting argument to split this analogue of G reg into 
2-factors. In Section [7] we show how to merge the cycles in each of these 2-factors into 
a Hamilton cycle. Finally in Section [8] we combine the results from the rest of the 
paper to prove Theorem [2j using the iterative approach discussed above. 

3. Notation 

Throughout the paper we use the following notation: for a graph G and sets A, B 
of vertices of G, we write ec(A, B) for the number of edges of G with one endpoint 
in A and the other in B. Let ec{A) = ec(A, A). For a graph G, let e(G) denote 
the number of edges of G, and for a spanning subgraph H of G, let G\H denote the 
graph obtained by removing the edges of H from G. Ng(A) is always taken to be the 
external neighbourhood of A, i.e., Nq(A) = (\J xeA N(x))\A. log denotes the natural 
logarithm, and we write log a n for (logn) a . 

Since we are aiming to prove a result with high probability, we may and do assume 
throughout the paper that n is always sufficiently large for our estimates to hold. 
Further we omit floor and ceiling symbols, and assume large quantities to be integers, 
whenever this does not have a significant effect on the argument. 

4. Pseudorandom graphs 

Our aim in this section is to establish several properties of random graphs which we 
will use later on, mainly regarding the degree sequence and expansion of small sets. 
For many of these, the fact that they hold whp in G n<p is well known; however, we need 
them to hold in log n / log log n random spanning subgraphs of G n)P simultaneously. To 
accomplish this we first show that they hold in G n ^ p with probability 1 — 0(1/ log n), 
and then take a union bound. 

The following useful definition is due to Thomason [29]. It involves tighter bounds 
than the similar and more common notion of e-regularity. We will rely on these in the 
proof of Theorem [2l 

Definition 4. Letp,(3 > with p < 1. A graph G is (p, /3)-jumbled if\ec{S)—p{^ \ < 
/3s for every S C V(G) with \S\ = s. 

We will often use the following immediate consequence of Definition Let G be a 
(p, /3)-jumbled graph and let S, T C V(G) be disjoint. Then 

(i) |e G (5,r)- P |5||r||<2^(|5| + |r|). 

To see this, note that ec(5, T) = ec(SUT) — ec(S) — ec{T); now applying Definition 
[Hand using the triangle inequality implies (pQ). 

The following two definitions formalise the notion of 'degree gap', which we need in 
our proof. 
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Definition 5. Let G be a graph on n vertices with a vertex xq of minimum degree, 
and let u < n. Then G is u-jumping if every vertex of G apart from xq has degree at 
least 5(G) + u. 

Definition 6. Let G be a graph on n vertices. For a set T C V(G), let da(T) 
be the average degree of the vertices of T in G. Then G is strongly 2-jumping if 
d G (T) > 5(G) + min{|T| - 1, log 2 n} for every T C V(G). 

Note that if G is strongly 2-jumping then it is also 2-jumping. In addition to these 
three properties we will use several other bounds concerning the degree sequence and 
edge distribution of a random graph. The following definition collects these properties 
together. 

Definition 7. Call a graph G on n vertices p-pseudorandom if all of the following 
hold: 

(a) G is (p, 2 y/ np(l — p)) -jumbled. 

(b) For any disjoint sets S,TC V(G) with \S\ = s and \T\ = t, 



(c) np — 2^/np\ogn < 5(G) < np — 200y / np(l — p) and A(G) < np + 2\Jnp\ogn. 

(d) G is strongly 2-jumping. 

Definition [7{a) gives good bounds on the densities of large subgraphs but not on 
those of very small subgraphs, which is why we need (b). 

The remainder of this section will be mainly devoted to showing that the random 
graphs we consider in the rest of the paper are in fact pseudorandom (see Lemma [T5|) . 
For (a), (b) and (c) we will prove slightly stronger bounds, as these will be used in [T7] . 
We will need the following large deviation bounds on the binomial distribution, proved 
in |18j (as Theorem 2.1, Corollary 2.3 and Corollary 2.4 respectively): 

Lemma 8. Let X ~ Bin(n,p). Then the following properties hold: 

(i) lfh>0, then P [X < np - h] < e~ h2 / 2np . 

(ii) Ife< 3/2, then F[\X - np\ > enp] < 2e" e2np / 3 . 

(hi) If c > 1 and d = log c — 1 + 1/c, then for any a > cnp, ¥[X > a] < e~ c a . 

The following lemma implies that the property in Definition [T^a) holds with very 
high probability in G n:P for the desired range of p. 

Lemma 9. Suppose that np/ log 2 n — > oo and n(l—p)/ log 2 n — > oo, and let G ~ G U:P . 
Then the probability that G is not (p, |ynp(l — p))- jumbled is at most 2/n 2 . 

Proof. Without loss of generality we may assume that < p < 1/2 (by considering 
the complement of G if p > 1/2). For each set S C V(G) with \S\ = s, we have that 




e(S) ~ Bin((*),p). Let 




and note that epN = |-y/np(l — p)s. Call S bad if \ec(S) — pN\ > epN. We consider 
the following cases: 
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Case 1: s > 2y p ^ + 1. Then e < 3/2, and hence by Lemma E^ii) we have that 
the probability of S being bad is at most 2e~ £2 P N / 3 . But since p < 1/2, we have 

3ns(l— p] 
2(s-l)' 



£ 2 piV/3 = ^ffef 1 > ¥■ So the probability that 5 is bad is at most e" 3n / 4 . 



Case 2: s < 2y — + 1. Since e > 3/2 in this case, we have that epN > pN and 

hence F[e G (S) < pN - epN] = 0. Let c = e + 1, so c > 5/2. Let c' = log c - 1 + 1/c 
and note that since d is an increasing function of c for c > 1, we have d > log(5/2) — 
1 + 2/5 > 1/4. Now by Lemma[5[iii) applied with a = (1 + e)piV, the probability that 
5 is bad is at most e ~ c '( 1 + 6 )P N < e" c ' V™^ 1 "^ < e -4siogn = n -4s_ 

We now take a union bound on the probability that there exists a bad set S. Firstly 

for any s < 2^J n ^ p ^ - + 1, the probability that there is some bad set S with \S\ = s 

is at most ( n )n _4s < n _3s < n~ 3 . Summing over all such s, we have an error bound 
of n 1-3 = 1/n 2 . Further the probability that there exists a bad set S with \S\ = s > 



Ini^pl + 1 ig at mogt 2 „ e - 3 n/4 < ^2 

the proof. □ 



2y p + 1 is at most 2 n e 3n / 4 < 1/n 2 . Now adding these two bounds completes 



To check the properties in Definition 0(b) we can use the following two simple 
lemmas, which are slight strengthenings of Lemmas 5 and 6 respectively in |20j . 



Lemma 10. Let G ~ G np . Then with probability at least 1 — 1/n 2 , the following 
properties hold for any disjoint A,BQ [re], with \A\ = a, \B\ = b: 

CO V {1 + 1)^^1 thene G (A,B)<l(a + b)logn. 
( u ) V (5 + I) ^ < 5> then e G (A,B) < 6abp. 

Proof. Let X := e(A,5). So X ~ Bm(|i4||J3|,p). Write t := + |5|)logre. First 

1 , J_ \ logn ^ 7 T i OT . ^ 21 



suppose that (^pj + ygy J > |- Then t > — and so by Lemma Etui) 

applied with c = we have 

(2) P (X > t) < e-( lo ^-^) ( < e-T = n-I^I+I^D. 

If \A\ + |5| < 20 then X < \A\\B\ < t. Otherwise, © implies that 

1 



F(X >t)<n 



-4 



n |A|„[B[' 

and so the result follows by a union bound. 

Now suppose ^pq- + i^y^ -^p < |, so that t < ^-\A\\B\p. Then, from Lemma[8^iii) 
applied with c = 6 and a = 6\A\\B\p, we obtain 

F(X > Q\A\\B\p) < e -(log6-|)6|A||Bb < e -%\A\\B\p < e - tj 
and so the result holds as before. □ 

The proof of the next result is almost the same as that of Lemma [TOl and so we 
omit it. 

Lemma 11. Let G ~ G(n,p). Then with probability at least 1 — 1/n 2 , the following 
properties hold for every A C V(G) with \A\ = a: 

(i) If^>\, then e G (A) < falogn. 

(ii) If^<l thene G (A)<3a*p. 
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It remains to establish the bounds in Definition E|c) on the minimum and maximum 
degree of G UtP (see Lemma [T3|) . and the fact that G n ^ p is strongly 2-jumping with 
probability 1 — 0(1/ log n) (see Lemma fT4j) . For this we need estimates on the binomial 
distribution which do not follow from standard Chernoff bounds (see Lemma [T2|) . 
These estimates use the following notation, where X ~ Bin(n — l,p): 

• b(r) = F [X = r] = ( n ; 1 )p r (l - p) n - r -\ 

• B(mi,m 2 ) = IP [mi < X < 7112], and 

• B(m) =F[X <m\. 

b'(r), B'(mi, 777,2) and B'{m) are defined similarly for X ~ Bin{n — 2,p). 

Lemma 12. Suppose that np(l — p) —> 00. Let mi = np — 2 a/ np log n, 777-2 = np — 
200a/ 77,73(1 — p), 777,3 = np — ^-y/nplog n and A = 1 — 1/(8 log 3 n). Then 

(i) 77,5(771,2) > 776(7772) > \/n/logn ; and 

(ii) 775(777,1) < 775(7773) < l/y/n. 

Suppose in addition that p > 48 2 log 7 n/77 and 1 — p > SGn^ 1 / 2 log 7 / 2 77. Then 



(iii) 



iv 



(v) 



b(r-l) 
B(mi —8 log a n,r) 



> A /or eac/i r > 7771 — 8 log n, 



b'(r) 

b(r) 



> 4 log 77 for each r > 7771, and 



< 1 + 1/ log n for each r > mi . 



Proof, (i) Let X ~ Bin(n — l,p), a = yj(n — l)p(l — p) and m 2 = np— 201a/ np(l — p). 
Now the de Moivre-Laplace Theorem (see e.g., Theorem 1.6 of [6]) states that if a — )■ 00 
and X2 > xi are constants, then 

B((n - l)p + xia, (n - l)p + x 2 a) = (1 + o(l))(</>(x 2 ) - 4>(xi)), 

where <j)(x) is the cumulative density function of the normal distribution. Clearly 



B{np + xiy / np(l — 77), np + X2 -\/np(l — p)) 
= (1 + o(l))5((n - l)p + xia, (n - l)p + x 2 a). 

To see this, note that the boundaries of the interval change by at most 2 and that 
b(r) — > for any r. Hence we have that 



where c 
So 



B(m' 2 ,m 2 ) = (1 + o(l))c, 
-200) — </>(— 201) is constant. Clearly b{r) < b(m 2 ) for each m' 2 <r< m 2 . 

[1 + o(l))cn 



nb(m 2 ) > 



> 



n 



v / np(l -p) + 1 log n 
(ii) Let X ~ Bin(n — l,p). By Lemma[8ji) we have that 



X < np — 15a/ np log n/8 < P X < (n — l)p — 7 a/ np log n/4 



< e 



-(7\Mpl°g ra/4) 2 /2np 



< e 2 



log 71 



77 



3/2 ' 



(iii) We have 



b{r-l) (r^i)p r - 1 (l-p) n ~ r (n- l)!r!(n-r- 1)!(1 - p) r(l-p) 



6(r) 



/n-l 



.)p r (l -p) n " 



r-1 



(n — l)!(r — l)!(n — r)!p (n — r)p' 
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and so 



b(r — 1) np — r ^ 2^/np log n + 8 log n 



6(r) (n-r)p (n - + 2 Vnp log n + 8 log 3 n)p 



3\/ np log n 3-v/log n 



np — np 2 (1 — p)y/rvp 

Now if p > 1/2 then (1 — p)y / np > (36n _1 / 2 log 7//2 n)(y / n/2) and the result follows. 
On the other hand if p < 1/2 then (1 — p)y/rvp > (48 log 7 / 2 n)/2 and again the result 



follows. 

(iv) Note that 



B(m\ - 8 log 3 n,r) > b(r - 1) b(r — 2) 6(r-81og 3 n) 
6(r) ~~ b(r) b(r) b(r) 



> l + A + A 2 + ... + A 81og3n 

1 - e" 1 

> — o— > 41og 3 n, 

" 1/(8 log 3 n) ~ 



Y _ ^log 13 n+1 

l^A 



where in the final line we use that A < e VC 81 ^ 3 "). 
(v) 

b'(r) _ ( n ;V(l - p) n - r - 2 (n - 2)!r!(n - r - 1)! 



b{r) ( n ~ 1 )p r (l-p) n - r ~ 1 (n- l)!r!(n- r- 2)!(1 - p) 

n — r — 1 np — p — r ^ _^ 2^/np\ogn 



(n - 1)(1 - p) (n - 1)(1 - p) - (n - 1)(1 - p) 

1 



— p) log n ' 

as desired. □ 



Lemma 13. Let G ~ G n>p . Suppose that np(l — p) — ?■ oo. TTien 

(i) P [5(G) < np — Ig-^nplogn < 1/y/n, and 

(ii) P [A(G) > np+ ^V^logn] < 1/v/n. 

Suppose further that p > 48 2 log 7 n/n and 1 — p > ?>§n~ 1 l' 1 log 7//2 n. Lei mi = np — 
2\J np log n, m,2 = np — 200-y/ np(l — p) and /ei mi < m < ni2 be such that nB(m) > 
logn and nB(m — 1) < logn. TTien 



(iii) P 
fiv) P 



(5(G) > m] < 4/ logn, and 



(5(G) > np - 200y / np(l - p) < 4/ logn 



Note that the results in Chapter 3 of [6] give sharper bounds which hold with 
high probability. However as mentioned earlier, we need our error probability to be 
smaller. Since this does seem to affect the precise results we need to prove the bounds 
explicitly. Also note that by Lemma [T2Ti) and (ii), nB{m\) < logn < nB(m2) (with 
room to spare), and so there exists m\ < m < m-i such that nB(m) > logn, but 
nB(m — 1) < logn. Thus (iii) is not vacuous. 

Proof, (i) By taking a union bound we have that 



(5(G) < np — 15^/ np log n/8 < nB(np — 15y np logn/8) < l/y/n, 
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where the last inequality follows from Lemma [T27ii) . 

(ii) Apply (i) to the complement of G. 

(iii) Let Y be the number of vertices v E V(G) such that mi < d(v) < m. As 
nB{m\) < by Lemma [T27ii) . we have 

E(Y) = nB(mx,m) > nB(m) — nB(mi) > logn — \j\fn > 21ogn/3 

and 

E 2 (Y) = n(n - l)(pB'(mi - 1, m - l) 2 + (1 - p)B'(m 1 ,m) 2 ) < n 2 B'( mi ,m) 2 . 
Hence 

v^w) < e^^w _ Er =mi ^(of? j_ 

where the last inequality follows from Lemma fT2Tv). Hence 

Var(Y) =E 2 (Y) + E(Y) - E(Y) 2 < (1 + 1/ log n) 2 E(Y) 2 + E(F) - E(y) 2 

= (2/logn + l/log 2 n)E(Y) 2 + E(y). 

So by Chebyshev's inequality, 

, Var(Y) 2 114 
P(y = 0) < — ^ < + + — — < 



E(Y) 2 " logn log 2 n E(F) "logn' 

(iv) This follows immediately from (iii) and the remark after the lemma statement. 

□ 



Lemma 14. Let G ~ G nyP . Suppose that p > 48 2 log 7 n/n and 1—p > 36n l l 2 log 7 ^ 2 n. 
Then with probability at least 1 — 6/ logn, G is strongly 2-jumping. 

Similarly to Lemma [131 note that Theorem 3.15 in [6] would imply Lemma [T4"l if we 
only required the statement to hold with high probability. 



Proof. Let mi = np — 2-y/nplogn, tti-2 = np — 200^ np(l — p) and A = 1 — 1/(8 log n). 
Let mi < m < m 2 be such that nB{m) > logn, but nB{m — 1) < logn. By Lemma 
fTBT iii) there exists a vertex of degree at most m with probability at least 1 — 4/ logn. 
So it suffices to show that with probability at least 1 — 2/ log n there are no two vertices 
each of degree at most m + 2 log 2 n whose degrees differ by at most 1. 

Let Z be the number of (unordered) pairs v\, v 2 of vertices such that mi < min{d(v i), d(v2)} < 
m + 2 log 2 n and \d{v\) — d(v2)\ < 1. We have 

m+2 log 2 n 



o) Pb'(r-1) 2 + 2pb'(r- l)b'(r) + (1- p)b'(r) 2 + 2(1- p)b'(r)b'(r + I) 

' T=m\ 

2 m+21og 2 n 2 , 1 \ 2 m + 21o § 2n 

< y £ 3 &W(r + l)< — (l+j— ) • E Kr)6(r + 1) 



r=mi x ' r=m\ 

< 2n 2 b(m + 2 log 2 n + l)B(mi,m + 2log 2 n), 

where the third inequality follows from Lemma [T2Tv). Note that 

(3) A2log2n+2>1 _2Jog 2 n + 2 > 6_ 

8log 3 n - 7 
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Now by Lemma [T27 iii) . 

nB{m u m + 21og 2 n + l)= ]T nb(r) < ^ lo / w+2 

r=mi r=mi 

nB(mi — 2 log 2 n — 2, m — 1) 



^2 log 2 n+2 

nB(m — 1) logn © 7 logn 

— ^2 log 2 n+2 — ^2 log 2 n+2 — 6 

So Lemma [T2T iv) implies that 

nB(m\ — 8 log 3 n, m + 2 log 2 n + 1) 



n6(m + 21og z n + 1) < 



< 



4 log 3 n 

nB(rti\ — 8 log 3 n, mi — 1) + nB(rti\,m + 2 log 2 n + 1) 

4 log 3 n 
8 log 3 n • nb(mi) + g log n 
4 log 3 n 



and now since b(mi) < B{mi), Lemma [T2T ii) implies that 

2 7 1 

nb(m + 2 log 2 n + 1) < — H ^— < ^— . 

J ~ y/n 241og 2 n - 31og 2 n 

Hence 

E(Z)<2-^. T77 V< 1 



6 3 log n log n 

So by Markov's inequality ¥[Z > 1] < 1/logn. Now by Lemma fl~37i) . the probability 
that there are any vertices at all of degree at most mi is at most 1/y/n. So with 
probability at least 1 — 1/ log n — 1/ y/n > 1 — 2/ log n, there are no pairs of vertices of 
degree at most m + 2 log 2 n, whose degrees differ by at most 1. □ 

We now combine the above results to prove that our desired pseudorandomness 
conditions hold whp in G UtP . 

Lemma 15. Let G ~ G n>p . Suppose thatp > 48 2 log 7 n/n and 1—p > 36n -1 / 2 log 7//2 n. 
Then the probability that G is not p -pseudorandom is at most 11/ logn. 

Proof. By Lemmas El EH CEH Efti), HSfiv) and [H the probability that G is not 

p-pseudorandom is at most 

2 1111 4 6 11 
+ + —f= + —f= + ; + ; < 



n 2 n 2 n 2 ' y/n ' y/n ' logn ' logn " logn 

□ 



Lemma 16. Let Gq ~ G rajP0 . Let Pi, ■ ■ ■ ,p§ be positive reals such that pi + p2 + P3 + 
P4+P5 = Po- Let m2, m-3, m^ < logn/ log logn be positive integers. For each i = 2,3, 4, 
let p (i)1) , . . . ,p^ 2mi +i) be positive reals such that p^ x) + p (i)2 ) + . . . + P(i,2 mi +i) = Pi- 
Suppose that 48 2 log 7 n/n < p < 1 — 36n~ 1//2 log 7//2 n whenever p = pi for < j < 5, 
P = / or i = 2, 3, 4 and 1 < j < 2m { + 1, or p = p { + /or i = 2, 3, 4. 

Define G\, G$ and Gu n, . . . , G(j 2m +i) / or * = 2,3, 4, as follows. For each edge ah 
<>f Go: 

• With probability pi/po let ab be an edge of G\. 
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• For i = 2, 3, 4 and 1 < j < 2rrii + 1, with probability puj-\/po let ab be an edge 
of G (iJ) . 

• Otherwise (i.e. with probability ps/po), let ab be an edge ofG^. 

For i = 2,3,4, let Gi = U^=i +1 Then whp, the following properties hold: 

(i) Gi is pi-pseudorandom for each < i < 5, 

(ii) Guj\ is P(ij)- pseudorandom for i = 2,3,4 and 1 < j < 2reij + 1, and 

(iii) Gi U Gj+i is (p» + Pi+i) -pseudorandom for i = 2, 3, 4. 

Proof. First note that ~ G niP(ij) for each i,j and that Gj ~ G n . Pi for each i. 

Further, d U G i+ i ~ ^.(pj+pt+x) for * = 2 ' 3 ' 4 - 

Hence by Lemma [15] for each graph, the probability that it is not p-pseudorandom 
for the relevant value of p is at most 11/ log n, and so by taking a union bound we 
have that the probability that at least one of these graphs is not pseudorandom is at 
most 67/ log log n, which tends to as n — > oo. □ 

At one point in the proof of Theorem [2] we will need G UtP to be u-jumping for some 
u > 2. For this we use the following result. 

Lemma 17. Let G ~ G niP with np/logn — > oo and n(l — p) / \ogn — > oo. Then with 
high probability, G is 8y^np(l — p)j log 3//4 n- jumping. 

We will apply this lemma only once (in Section [8]) , so in this case a whp estimate is 
sufficient. 

Proof. The conditions of the lemma immediately imply that np(l —p)j log n — > oo. Let 
m = min{(rap(l — p)/\og re) 1 / 5 , log 1 / 16 n} and let a = 81og _1 ^ 16 n; note that m — > oo 
and a —> 0. Applying Theorem 3.15 in [6] to the complement of G, we have that G is 
u-jumping where 

a ( np(l — p) \ 8A/rep(l — p) 

m 2 V logn J ~ log 3 / 4 n 

□ 

5. Constructing regular spanning subgraphs 

The first step in our general strategy for finding a large collection of Hamilton 
cycles in a p-pseudorandom graph G is to construct a regular spanning subgraph of G 
of degree 5(G) if 8(G) is even, or of degree 5(G) — 1 if 5(G) is odd. The aim of this 
section is to establish that this is always possible for our range of p. 

We use Tutte's r-factor theorem: Given a graph G, an integer r, and disjoint subsets 
S and T of V(G), let 

R r (S,T) =J2 d ( v ) ~ e G (S,T)+r(\S\ - \T\). 

Let Q r (S,T) be the number of components C of G— (SUT), such that r\C\ +ec(C, T) 
is odd. 

Theorem 18 (Tutte |31j). Let G be a graph and r be a positive integer. Then G 
contains an r-factor if and only if R r (S,T) > Q r (S,T) for every pair S,T of disjoint 
subsets of V(G). 

For the particular case of a 1-factor (that is, a perfect matching), we use the following 
simpler result: 
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Theorem 19 (Tutte [30]). Let G be a graph. Then G has a perfect matching if and 
only if for every S C V(G), the number of components of G — S which have an odd 
number of vertices is at most \S\. 

In order to make use of Theorem 1181 we first need to bound Q r (S,T). For this we 
use the following lemma: 

Lemma 20. Let G be an re / 1 n-pseudorandom graph on n vertices such that re > logn. 
Let ru > ft§ r G; and let H be a spanning subgraph of G such that 5(H) = r#. Let 
W C V(G), and suppose that 5(H[W]) > r#/3. Then for any nonempty B C W , the 
number of components of H[W\B] is at most \B\. Moreover, H\W] is connected. 

Proof. Suppose that a set B violates the first assertion. 

Claim: H[W\B] cannot have two disjoint isolated sets (i.e., unions of components of 
H[W\B]) each of size at least n/32. 

To prove the claim, it suffices to show that for disjoint sets S, T C W\B such that 
| /S' | , | T | > n/32, we have eu{S,T) > 0. To see that this holds, note that since G is 
(rc/n, 2- v /rY;)-jumbled by Definition [7J a), (P) implies that 

fc rr\ ^ r c\S\\T\ — r G n — 

e G (S, T) > Vtg(|5| + \T\) > - A^n. 

But since ru > ^r G , it follows that 

\E(G)\E(H)\ = e(G) - e(H) < ^ + 2^n - = ^ + 2^n, 

and hence en(S,T) > 0, which proves the claim. 

Let C be the set of components of H[W\B] of size less than n/32, and let A = (JC. 
Suppose that \A\ > n/8. Then we can form an isolated set S such that n/32 < < 
n/16 by taking successive unions of components in C. Similarly we can form another 
isolated set T with n/32 < \T\ < n/16 using the remaining components in C. But now 
S and T are disjoint, which contradicts the claim. Hence \A\ < n/8. Moreover the 
claim implies that iT[VF\-B] has exactly one component which is not in C. 

Now since by assumption, ff[VF\.B] has more than \B\ components, we have \A\ > 
\C\ > \B\. Now since A is isolated in H[W\B], it follows that the H [VK] -neighbourhood 
of A lies entirely in B. Hence every edge of i7[W] which is incident to some vertex of 
A lies in E H (A) U E H (A,B). It follows that J2veA N H[W]( V ) < 2e H (A) + e H (A,B), 
noting that edges in en (A) will be counted twice on the left-hand side. So 

r H \A\/3 < \A\6(H[W}) < 2e H {A) + e H (A, B) < 2e G {A) + e G (A, B) 

< r G \A\ 2 /n + 4v^g|A| + r G \A\\B\/n + A^{\A\ + \B\) 

< 2r G \A\ 2 /n + 12^\A\ < ( - + r G \A\, 

which is a contradiction unless |A| = 0. But if \A\ =0 then we have B = 0, which 
proves the assertion. The moreover part follows from the special case where \B\ = 1. 

□ 



Corollary 21. Let G be an r G /n-pseudorandom graph with r G > log 2 n and let G' be 
a graph obtained from G by first deleting an arbitrary matching M and then adding 
an arbitrary set of additional edges. Let r be an even integer. Then defining Q r (S,T) 
with respect to G' , any disjoint subsets S,T ofV(G) satisfy Q r (S,T) < \S\ + \T\. 
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Proof. By Definition GJc) we have that 5(G\M) > r G - 2^r G log n - 1 > §§§r G . Note 
that the number of components of G' — B is always at most that of (G\M) — B for 
any B C V(G). Now if S U T / then applying Lemma [20] with G = G, H = G\M, 
W = V(G) and B = S U T implies that H — (S U T) has at most \S U T\ components. 
It follows that Q r (S,T) < \S U T\ = \S\ + \T\. On the other hand, if S = T = 
then Lemma [20] implies that H (and thus G') is connected. Let C be the unique 
component of G' and note that r\C\ + e G /(C, T) = r\C\, which is even. So in this case 
Q r (S,T) = = \S\ + \T\. □ 

We are now ready to prove that every pseudorandom graph G has a regular spanning 
subgraph whose degree is equal to 2[6(G)/2\. In fact we will prove the following 
slightly stronger statement, which gives the same result even if G is modified slightly 
from being pseudorandom. 

Lemma 22. Let G be a p-pseudorandom graph on n vertices such that np(l—p) / log 2 n — )• 
oo. 

(i) For any vertex x G V(G), G contains an optimal matching which covers x. 

(ii) Let u < 4i/np(l — p), and suppose in addition that G is 2u-jumping. Let 
G' be formed from G by adding u edges at the vertex xo of minimum degree, 
and if 5(G) + u is odd, also removing an arbitrary matching M. Let r be the 
greatest even integer which is at most 5(G) + u. Then G' has an r -regular 
spanning subgraph. 

Note that if the matching we remove in (ii) covers xq, then r = do'(xo). The case 
of Lemma [22Tii) when u = is the only place in the proof of Theorem [2] where we use 
the fact that a pseudorandom graph is strongly 2-jumping (i.e., Definition E^d)). One 
can probably replace this by a weaker condition. On the other hand, Defintion E^a), 
(b) and (c) are probably not in themselves sufficient to prove Lemma [22] 

Proof, (i) If n is even then Theorem [19] and Lemma [20] together imply that G has a 
perfect matching, and the result follows immediately. If n is odd remove any vertex 
y 7^ x from G. Now by Lemma [20] the number of components of G — (S U {y}) is at 
most |5| + 1 for any S C V(G). But if the number of components is exactly | *S' | + 1, 
then at least one component must have an even number of vertices (otherwise we have 
n — \S\ — 1 = 151 + 1 mod 2 which cannot hold). Hence we can apply Theorem 1191 to 
G — y and the result follows. 

(ii) For disjoint sets S, T C V(G), let s = \S\ and p = |T|/|5|. Throughout the 
remainder of the proof let R r (S, T) and Q r (S, T) be defined with respect to the graph 
G'. By Corollary EH Q r (S,T) < \S\ + |T| = s(p + l) and so by Theorem[Tg]it suffices 
to show that R r (S,T) > s(p + 1) for all S and T. If T is nonempty, let dc(T) be 
the average degree, in G, of the vertices of T, and define dc(S), dc(T) and dc(S) 
similarly. Note that 

R r (S,T) = d G/ (T)\T\-e G ,(S,T) + rs(l-p). 

Note also that dc(xo) > r. We claim that dcix) > r + 2 for all x ^ xo. Indeed, if 
u = and 5(G) is even, we have 

d G ,( x ) = d G (x) > 5(G) + 2 = r + 2 

since G is 2-jumping by Definition [7]^d). If u = and 5(G) is odd, then similarly we 
have 

d G '(x) > d G (x) - 1 > 5(G) + 1 = r + 2. 
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If u > 2 and 5(G) + u is even, we have 

d G ,(x) = d G (x) > 5(G) + 2u > r + 2, 

and finally if u > 2 and (5(G) + it is odd, we have 

dcCa:) > - 1 > 5(G) + 2u - 1 > r + 2. 

This proves the claim. It follows immediately that unless T = {xq}, we have cIqi(T) > 
r + 1. Now we consider the following cases: 

Case 1: \S\ = 0. Observe that R r (S,T) = d G ,(T)\T\ - r\T\ and Q r (S,T) < \T\ by 
Corollary EIJ If either \T\ = or d G >(T) > r + 1 then trivially R r (S,T) > \T\ > 
Q r (S,T). So we are left with the case in which T = {xo} and d G /(xo) = r. 

In this case it suffices to prove that Q r (S,T) = 0. To see that this holds, note that 
applying LemmaEHlwith H = G\M, W = V(G) and B = {x } implies that G' - {x } 
is connected. But the unique component C of G' — {xq} satisfies r\C\ +ec(C, {xq}) = 
r(n — 1) + r = rn, and since r is even C cannot be odd. 

Case 2: \S\ > and p < 1/2. Since e G /(5,T) < d G ,(T)\T\ and \S\ > 2\T\, we have 

R r (S,T) > r(\S\ - \T\) > ^(\S\ + \T\) > \S\ + \T\ > Q r (S,T), 
where the last inequality follows by Corollary [2TJ 

Case 3: \S\ > 0, p > 1/2 and (± + ± \ ^ > |. In this case we have that e G /(5, T) < 

e G (S, T) + (p + l)s < 2(p + l)s log n + (p + l)s, where the last inequality follows from 
Definition [3(b). So 

(4) R r (S,T) -Q r (S,T)> ps(d G ,(T) - 21ogn - r - 2) + s(r - 21ogn - 2), 

and it suffices to prove that the right-hand side of is non-negative. Now observe 
that if p < r ^f gn+2 2 ' then this is immediate since d G >(T) > r. On the other hand, if 

P ^ r 2?Jgn+2 2 ' then l T l ^ r 2Lgn+2 2 ^ 3iHgH ^ 3 lo § n - So b y Definition (T^d) we have 
d G '(T) > r + 3 log n — 1 > r + 2 log n + 2, and the result follows. 

Case 4: \S\ > 0, p > 1/2, (7 + ^) ^ < 1 and < n/30. In this case Definition 

[71(b) implies that e G / (S 1 , T) < 7ps 2 p + u. So 

R r (S,T) - Q r (S,T) > ps(d G ,(T) - r - 1) + s(r - 7psp - w/a - 1), 

and it suffices to prove that the right-hand side of this inequality is non-negative. 
Recall that d G i(T) > r + 1 as \T\ = ps > 2(p + 1) log n/7p > 2, and so the first bracket 
is non-negative. Also 7psp < np/A < r/3 by Definition [TJc), and u/s < u < r/3. 
Hence the second bracket is positive. 

Case 5: |5| > 0, p > | and ps > n/30. Since G is (p, 2^np(\ — p))-jumbled, ([I]) 
implies that 

e G (S, T) < ps 2 p + Vnp(l-p)s(p + 1). 
Note that dc(T)|T| — e G /(S,T) > d G (T)\T\ — e G (S,T) — u. Now we claim that 
d G (T)\T\ -u>(np- 130y/np(l - p))\T\. Indeed, 

d G (T)\T\ = 2e G (T) + e G (T, V(G)\T) 

> ^('2') +Pl T K n ~ l T D ~^np(l-p)\T\ - 4y/np(l - p)n 

> np\T\ — p\T\ — (An + 4|T|) \Jnp(\ — p) > (np — 130a/ np(l — p))|T| + u. 
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Hence 

R r (S, T)/s> p(np — 130V 7 np(l — p)) — psp — 4ynp(l —p)(p + 1) + r(l — p) 
> p(np — r — 142 np(l — p)) + r — psp. 

But 

5(G) - 1 < r < 5(G) +u<np- 200^np(l-p) + A^Jnp(l-p) 

= np — 196^ np(l — p). 

Hence 

R r (S, T)/s > 5Apy/np(l - p) + 5(G) - 1 - psp > 50p^/np(l-p) + 5(G) - psp. 
Further as (p + l)s < n, we have 

— ZjTT ) = — — — 2y / raplogn. 

Nowifp^ i^/^then^ > 3J > 2y/np log n and so R r (S, T) /s > 50py/np(l-p) > 
p + l. 

On the other hand if p > gv/jj^ > y^T^p t nen %P\/npU^-p) > 2^/nplogn and 
so R r (S,T)/s > 48^^71^(1 - p) + + 1) > p + l. □ 

We remark that as long as rep/ log 6 re — )• oo and re(l — p) / log 6 re — )• oo (i.e., under less 
restrictive assumptions than those of Theorem [2|), Lemma [221 implies that whp G ntP has 
a 2[(5(G nj p)/2j-regular spanning subgraph. The fact that G ntP is whp p-pseudorandom 
for this range of p follows by Lemmas [9l llll Lemma [T7] and Corollary 3.13 of [6]. 

6. Splitting into 2-factors 

Our aim in this section is show that under certain conditions, an even-regular graph 
H can be decomposed into 2-factors so that the sum of the number of cycles in these 
2-factors is not too large. We would like such a result to hold for arbitrary even-regular 
graphs H, but this does not seem feasible for the densities that we consider. So we 
first show that it suffices for H to be a spanning subgraph of a pseudorandom graph 
G (see Corollary [27]) . and then that it suffices for H to be 'partly' pseudorandom 
(see Corollary [3T|) . More precisely, we show that for any even-regular graph H' of 
degree rjj', the union of H' with an even-regular graph H which is close to being 
p-pseudorandom may be decomposed into 2-factors with few cycles in total, provided 
only that p is somewhat larger than rjji /n. 

In order to bound the number of cycles in each 2-factor we will transfer the problem 
to a bipartite setting. This will allow us to use the following result (which was conjec- 
tured by van der Waerden) , which gives a good bound on the total number of perfect 
matchings in a regular bipartite graph. 

Theorem 23 (Egorychev [10], Falikman [H]). Let r < n be positive integers and let 
B be an r -regular bipartite graph with vertex classes of size n. Then the number of 
perfect matchings of B is at least (^) n n\. 

Given a bipartite graph B on vertex classes V = {v%, . . . , v n } and V' = {v[, . . . , v' n }, 
and a matching M in fl, let D(M) be the digraph on [n] formed by including an edge 
from a to b if and only if there is an edge of M between v a and v' b . Note that if P is 
a perfect matching then D(P) is 1-regular. For an integer C, let T J c(B) be the set of 
perfect matchings P of B such that D(P) has at least C cycles. Let Vk,i(B) be the set 
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of perfect matchings P such that D{P) has at least k cycles of length I. We now use a 
counting argument to show that T > G {B) is small whenever C is large and B satisfies a 
very weak pseudorandomness condition. The proof builds on ideas from [12] and later 
developments in [23] and [20J. 



Lemma 24. Let B be an rs-regular bipartite graph on vertex classes V = {y\, . . . , v n } 
and V' = {v[, . . . , v' n }. Let log 2 n < rQ < n and y^r^logn < re < r G /2, and let 
C = 2n\Jr G logn/rs- Suppose that all S QV and T C V such that \S\, \T\ > n/ \/r G 
satisfy e B (S,T) < ^(IWI^ Let x = (£a)n„|. Then \T C (B)\ < X. 

A similar result was proved in [12] and later in [20] . In the proof of Lemma [25] we 
use the following result, which is part of the proof of Lemma 2 in |12] (see inequality 
(5) there). 

Lemma 25 (Frieze and Krivelevich, p2]). Let B be an r-regular bipartite graph on 
vertex classes V and V' of size n. Let m < n and let S C V and TCI/' each have 
size n — m. Let B' be the bipartite subgraph of B induced by S and T , and let Y m be 
the number of perfect matchings of B' . Suppose that es{S,T) > rn/2. Then 

y m < ( ~ 2m ) r + eB ( V \ S ' y/ \ T ) ^ ^ e -n+mr 3n yOn/r 
m ~ \ n — m J 

Note that Lemma [251 is stated in [12] with an estimate rm? for eB(V\S, V'\T), rather 
than explicitly including the term e B (V\S,V'\T). Since the graphs we consider are 
subgraphs of random graphs we will have cb{V\S, V'\T) <C m 2 . This will allow us to 
obtain nontrivial bounds on \Pc{B)\ even for very sparse graphs. 

Proof of Lemma \24\ Let k = n\ogn/r B - For each £, let Xk/ = \Pk,e(B)\, and let 
Yfc £ be the maximum number of perfect matchings between two subsets of the vertex 
classes of B, each of size n — kt. Let £q = rs/^rc logn. Since £q > y/log n we may 
take £q to be an integer. Note that 

(5) k£ = =^ = o(n). 



We first derive a bound on Y^g, for £ < £q. Consider any 5cy and T C V', each 
of size n - k£. We claim that e B \v\S,V'\T) < Wr G k 2 £l/n. Indeed, let S' C V and 
T C V' be sets of size k£ such that V\S C S' and V'\T C T. Then 

«(V\5, V\T) < e B(S >, T') < = 12^. 

In n 

Moreover, \S\ = \T\ = n — o{n) by ([5]) and so es{S,T) > r B n/2. Hence by Lemma 

Y ke < A n ~ 2kl >B + Wr G k 2 £ 2 /n \ n - M c _ n+M Wn/rn ^ 
' ~ \ n — k£ J 

We can further estimate this as follows: Observe that 

(n - 2k£)r B + 10r G k 2 £?,/n ( k£ 10r G k 2 £ 2 

~ r B 1 m + 



n — k£ \ n — k£ nrs(n — k£) 



< rB exp 



k£ 10r G k 2 ° 2 



n — k£ nrs(n — k£) 
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and hence 

'10r G k 2 £?, 



Y k/ <r n B - ki e~ ki exv 



e - n+kl (3n) 10n / r ' B 



(6) = r"- M e 10fe e- n (3n) 10n / rs , 

where the last line follows from that fact that 



r G kt\ ||p r G nVlog n r B 



1. 



nrB nrB \fr~G V r G log n 

Now we proceed to bound X k /. Note that if M\ and M2 are matchings such that 
D(M\) and D^M^j are cycles, then M\ and M2 are vertex-disjoint if and only if D(M\) 

and D(M2) are vertex-disjoint. We claim that there are at most Cu)r B ^ f-~ k ways 
of choosing k vertex-disjoint matchings M of size I in B, such that D{M) is a cycle 
of length I for each M. Indeed, we can construct these matchings as follows: First 
choose an initial vertex in V for each matching ((?) choices). For an initial vertex 
of M, select an unused neighbour v[ 2 6 V of % (at most re choices) and add v^v'^ 
to M. Then select an unused neighbour v' ia of Vi 2 (recall that Vi 2 is the vertex of V 
corresponding to v - 2 ), etc., until we have selected v\ . Now if v\ is a neighbour of v% t 
then add u^w^ to M and note that D(M) is a cycle of length I. Repeat this process 
for each initial vertex, in each case using only vertices we have not used before. This 
process described above constructs every collection of k vertex-disjoint matchings of 
size £ at least l k times, since we choose the initial vertex of each matching arbitrarily. 
The claim follows immediately. 

Since Y^g bounds the number of perfect matchings on the remaining vertices (i.e., 
those which are not contained in the above matchings), we have 

X k/ <(^rf-'k- k Y k ,. 
Estimating (™) < {^f) k and n\ > (f) n , we have 

lyl- 1 f (^) fc r^ 1 )r fc rr^ 1 ° fe e- n (3n) 10 ^ f^-Y (£ 

' ' * r k B £ - k r k r n B - M (3n) Wn / rB rz n = ( (3n) 10n / r * 



Hence 



ne 

— J ■ B - ■ B >-•< ■ U \]. frjJ 

(3n) 10n / rB . 

11 v log n/ 10 \ 10n/r B £ Q 



!log n 




e 



log n 



v log n/ 10 \ ^n/rs £o 

) 3n £r* 

7 / 1=2, 



But ((e 11 /logn) 1 °g n / 10 3n) 10n / rfl < 2~ 10n / rs < 1 and ^ fc < 3~ lo s n < 1/n for * > 3. 
Hence 

X- 1 ^ X M < V" < V" < 1- 

£=3 £=3 

Note that since fc^o < C/2 and n/lo = C/2, we have C > Mo + w/^o- But any 
1-regular digraph which, for all I < £q, contains fewer than k cycles of length I has 
fewer than kl$ + n/£o cycles in total. Hence \Vc(B)\ < Y^iL^-^-ki < X. □ 
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The following lemma and corollary accomplish our aim of decomposing an even- 
regular graph H which is close to being pseudorandom into 2-factors with few cycles 
in total. 

Lemma 26. Let tq > log 2 n, and let G be an rc/n-pseudorandom graph on n vertices. 
Let th > 2*/tg logn be even and let H be an rn-regular spanning subgraph of G. Then 
H contains a 2-factor with at most C = AnyJrQ Iogn/rjj cycles. 

Proof. Recall that Petersen's theorem [28] states that every even-regular graph can 
be decomposed into 2-factors. Therefore H can be decomposed into a collection 
Fx, . . . ,F rg of 2-factors, where rg = th/2. For each of these 2-factors we orient 
the edges so that each cycle is an oriented cycle, thus forming a collection of 1-regular 
digraphs on V{H). Now taking the union of these digraphs yields an orientation D of 
H which is re-regular. 

Label the vertices of V{H) as v\, . . . , v n . Form a bipartite graph B on vertex classes 
V = {vi, . . . ,v n } and V = {v[, . . . ,v' n } by joining V{ € V to v'j £ V if and only 
if there is an edge of D from v% to Vj. Now for any sets S C V and T C V' with 
\S\, \T\ > n/y/r~G, let S", T' be the corresponding subsets of V(H). Then 

e B (S, T) < e H (S' U T') < e G (S' U T') < r ^\±^l + 2^{\S\ + \T\) 

3r G (|g| + |r|) 2 
In 

where the third inequality follows since G is (rc/n, 2y / rYJ)-jumbled by Definition [7(a), 
and the fourth since 2 < y/ra(\S\ + \T\)/n. Now Lemma [2H implies that \Vc{B)\ < 
(— ) n!. But by Theorem 1231 the total number of perfect matchings of B is at least 
(— ) n!. So there exists a perfect matching P of B such that D(P) has at most C 
cycles. Now ignoring the orientation of D(P) yields the desired 2-factor of H. □ 



For a 2-regular graph F, let c{F) be the number of cycles of F, and for a collection 
T of 2-factors let c(J r ) = ^i? e jrc(F). Repeated application of Lemma [26] gives us the 
following result. 

Corollary 27. Let G be an re / n-pseudorandom graph on n vertices with re > log 2 n. 
Let H be an rn -regular spanning subgraph of G, such that m is even. Then H can 
be decomposed into a collection of 2-factors T = {Fi, . . . , F rH /2\, such that c(J-) < 

3ny re log 3 n. 

Proof. If rji < 8y / rc;logn then the result follows from Petersen's theorem [2S], noting 
that trivially c(F{) < n/3 for each i. So suppose rn > 8y / rclogn. Let iq = rjj/2 — 
i/rclogn and note that since iq > rji/4 > log 2 n, we may assume that i$ is an integer. 
By repeatedly applying Lemma [26] we have 

x - Any re logn v - , v - 1 

c{T) < ) — — + y n/3 < 2n^/r G logn > -+nVr^logn 

i f— ' r_ff/2 — ? + 1 . z 

i=l i=io+l i=^/rQ\ogn 

< 2n^J log n(log(r#/2) + y^logn) < 3n-\/ rc log 3 n. 

□ 
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Now we consider an arbitrary even-regular graph H' and aim to show that if we 
take an even-regular graph H on V{H') which is close to being pseudorandom and 
edge-disjoint from H' , then we can decompose H U H ' into 2-factors with a similar 
bound on the total number of cycles as in Corollary [271 Our strategy will be to first 
split H 1 up into matchings. We then extend each of these matchings into two perfect 
matchings using edges of H (see Lemma [28]) . and apply Lemma [23] to transform each 
of these perfect matchings into a 2-factor with few cycles (see Lemma 129]) . 

We start by considering the case of a single matching. If n is odd, then call a graph 
F on n vertices a pseudomatching if it has a unique vertex of degree 2 and all other 
vertices are of degree 1 . A perfect pseudomatching in a graph G is a pseudomatching 
covering every vertex of G. 

Lemma 28. Let G be an rein-pseudorandom graph on n vertices with rQ > log 2 n 
and let H be a spanning subgraph of G such that 6(H) > ||§?*g + 5. Let M be a 
matching with V(M) C V{G) and suppose M and H are edge- disjoint. Then 

(i) if n is even, then there exists a partition of M into submatchings M\, M2, 
such that each of Mi,M% can be completed to a perfect matching on V{G) 
using disjoint sets of edges of H , and 

(ii) if n is odd, then there exists a partition of M into submatchings M\ , M2 , such 
that each of M±,M2 can be completed to a perfect pseudomatching on V(G) 
using disjoint sets of edges of H . 

It would be more convenient to extend M directly into a perfect matching rather 
than splitting it first, but this does not seem to be possible. 

Proof. Let rjj = §§§ r G- We partition M randomly by placing each edge into either 
Mi or M 2 , with equal probability. For a vertex v 6 V(G), let Y v be the set of edges 
ww' of M such that both w and w' lie in Nh{v), and Z v the set such that exactly one 
of w and w' lies in N H (v). Note that 2\Y V \ + \Z V \ < d H (v) < A(G) < 2r H , and that 

|JVjr(t;)\y(Mi)| ~ 2Bin(\Y v \, 1/2) + Bin(\Z v \, 1/2) + \N H (v)\V(M)\. 

(Here we use that M and H are edge-disjoint and so the unique neighbour of v in M 
is not a neighbour of v in H.) 

Suppose \Y V \ > 81ogn. Let e = 4ylogn/|l^| < 3/2 and note that 2\Jrn logn > 
e I Y v \/2, since ru > \Yv\- By Lemma E^ii) we have that 

P \Bin(\Y±\/2) < \YJ/2 - 2^/r H logn\ < 2 e - £2|n|/6 < 2e - 161ogn/6 < \. 

On the other hand, if \Y V \ < 8 logn then |l^|/2 < 2i/th log n and so 

P Bin(\Y v \, 1/2) < \Y v \/2-2y/r H \ogn =0. 

By the same argument we can show that 

1 



P \Bin(\Z v \, 1/2) < \Z v \/2 - 4^r H logn 
So whp, every v £ V(G) satisfies 



< T 



\N H (v)\V(M 1 )\ > \Y V \ + \Z v \/2 - S^r H logn + \N H {v)\V{M)\ 



= -\N H (v) n V(M)\ + \N H (v)\V(M)\ - S^logn > r H /3 + 7, 

and hence 5(H — V{M\)) > rjj/3 + 7. By a similar argument the same holds for 
M%. Now choose a partition of M into M\ and M2 such that the above bounds on 
|iV // (v)\V r (Mi)| and \N H (v)\V(M 2 )\ hold for all vertices v £ V(G). 
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If n is odd, then we first find a path v\V2V% of length 2 in H — V(M\) and add it to 
M\ to form a pseudomatching, so that V(H)\V(M\) has an even number of vertices. 
We also remove v\V2 and V2V% from H. (If n is even then we simply omit this step.) 
Henceforth we proceed identically in cases (i) and (ii). 

Let W = V(H)\V(M{) and note that we still have 5(H) >r H + 3 and 5(H[W]) > 
r ///3+4. We now use Theorem [19] to show that i?[W] contains a perfect matching. For 
this it suffices to prove that for any set S C W, the number of components of ff[W / \5'] 
which have an odd number of vertices is at most \S\. If S is nonempty then this follows 
from Lemma [20] On the other hand if S = then by Lemma [20] H[W\S] = H[W] 
has exactly one component, namely W. But this component has an even number of 
vertices, and so by Theoremll9lff [W] contains a perfect matching M[. So Mi U M[ is 
a perfect matching or pseudomatching on V(G). We delete the edges in M[ from H. 

We now repeat the process for M2. First we extend M2 to a pseudomatching (if n 
is odd). Now we still have 5(H) > rjj and S(H — V(M%)) > rjj/3, so the conditions of 
Lemma [20] still hold and hence we can complete M2 to a perfect matching or pseudo- 
matching on V(G). □ 



Lemma 29. Let G be an rc/n-pseudorandom graph on n vertices with rQ > 21og 2 n ; 
and let H be a spanning subgraph of G such that 5(H) > §g?~G- Let P be a perfect 
matching (ifn is even) or a perfect pseudomatching (ifn is odd) on V(G), edge-disjoint 
from H . Then H contains a matching P' such that P U P' forms a 2-regular graph 
with at most 3nV / ( ^' ) ° g " cycles. 

We will apply Lemma [29] in the proof of Lemma [30] with P being one of the perfect 
matchings Mi obtained from Lemma |2"81 

Proof. Let m = 5(H) and note that r# > log 2 n by Definition E^c) . If n is odd, then 
let v\ be the vertex of P of degree 2 and ^2,^3 its neighbours. For the remainder of 
the proof we treat ^2^3 as if it were an edge of P, and V1V2, V1V3 as if they were not 
edges of P. If t>2t>3 is also an edge of H, then we delete it from H (so that P and H 
remain edge-disjoint). The proof then proceeds identically whether n is odd or even. 

Label the edges of P as e\, e2, • • • , e [n/2\ ■ F° r each edge e«, label one of the vertices 
dj (chosen at random and independently from the other choices) and the other 6j. Let 
B\ = {ai, . . . , a^ n / 2 \ } and B2 = {b\, . . . , &[n/2j }> an d let B be the bipartite graph on 
vertex classes B\,B2 with edges Eh(Bi, B%). 

Now for each v G V(B), let Y v be the set of edges ab G P such that both a and b lie 
in Nh(v), and let Z v be the set of edges ab G P such that exactly one of a and b lies 
in N H (v). Note that 

d H (v) - 1 < 2|y„| + \Z V \ < d H (v) < A(H) < A(G) <r G + 2Vr G logn < 

where the second-last inequality follows from Definition He). Now ds(v) ~ \Y V \ + 
Bin(\Z v \, 1/2). (Here we use the condition that P and H are edge-disjoint and so the 
unique neighbour of v in P is not a neighbour of v in H .) 

Suppose \Z V \ > 81ogn. Let e = 4ydogn/|Z^| < 3/2 and note that ru log n > 
e\Z v \/2. By Lemma [HJ^ii) we have that 

P \Bin(\Z v \, 1/2) < \Z v \/2 - A^fr H lognj < e ^ 2 l^l/ 6 < 2e - 161o s n / 6 < -L 

On the other hand if \Z V \ < 81ogn then \Z v \/2 < 4\/ log n and so 

// = 0. 



n 2 



Sm(|Z„|, 1/2) < \Z v \/2 - 4Vr H log 
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So whp every v G V(B) satisfies d B (v) > \Y V \ + \Z v \/2 — log n > rjf/2 — 

5i/th log n. Similarly, whp every t> G V(l?) satisfies d B (v) < llr#/20 + 5^r^ log n. 
Now we choose S such that <5(.B) > r#/2 — 5\/rff logn and A (5) < llr///20 + 
h\frn log n. Let = 2rff/5 and note that re > rcr/3. 

Claim: contains a regular spanning subgraph B' of degree r B - 

To prove the claim, we use the Max-flow Min-cut theorem. Let each edge of B have 
capacity 1. Add a source a joined to each vertex of B\ by an edge of capacity re and 
a sink r joined to each vertex of B 2 by an edge of capacity r^. Let n' = \B\\ = [n/2\ . 
Now we show that the minimum cut must have capacity at least n'rg (indeed exactly 
n'rBi since one could cut all of the edges incident to a). It follows that the maximum 
flow from a to r is n'rs- Further there is some flow which achieves this maximum and 
such that the flow along each edge is an integer. Taking the edges of B which have 
flow 1 immediately yields the desired r^-regular spanning subgraph. 

So consider a cut C. Let T be the set of vertices v G B\ such that av is not part 
of C. Similarly let S be the set of vertices v G B2 such that vt is not part of C. 
Now since every edge in Eb(T,S) must be part of C, the capacity of C is at least 
es{T, S) +rB(2n' — \S\ — |T|). So noting that ee(T, S) = e#(T, S), it suffices to prove 
that 

e H (T, S) > n'r B - r B (2n' - \S\ - \T\) = r B (\S\ + \T\ - n'). 

Let S' = B2\S. Then an equivalent statement is that 

e H (T, B 2 ) - e H (T, S') + r B (\S'\ - \T\) > 0. 

We can assume without loss of generality that \T\ + < n' < n/2; otherwise 
we can rearrange the inequality as en{B\,S) — ch{T',S) + 7-#(|T"| — \S\) > where 
T' = B\\T and the proof proceeds analogously. We now consider the following cases: 

Case 1: \T\ > 2\S'\. We can estimate e H (T,S') < |5'|A(5) and e H (T,B 2 ) > \T\S(B). 
Now 

e H (T, B 2 ) - e H (T, S') + r B (\S'\ - \T\) > \T\(5(B) - r B ) - \S'\(A(B) - r B ), 

and the right-hand side is positive since 5(B) — rs > — 5\/ r B log n > rn/12 and 

A(B) -r B < 3r H /20 + 5y/r H log n < r H /6. 

Case 2: \S'\ > \T\. Then the inequality holds trivially. 

Case 3: \S'\ < \T\ < 2\S'\. Note that \T\ < n/3 (otherwise \T\ + \S'\ > n/2). So since 
G is (ra/n, 2 ^/r^) -jumbled by Definition Efc), (JT[) implies that 

r B \S'\ - e H (T, S') > r B \S'\ - r G \T\\S'\/n - V^(|T| + \S'\) 

> \S'\(r B - r G \T\/n - 120^) > |5"|(r G /3 - r G /3 - 12^) 

= -12\S'\^> -12\T\y/¥d. 

Hence e H (T, B 2 ) - e H (T, S') +r B (\S'\ - \T\) > \T\5(B) -r B \T\ - 12\T\y/fa > 0, which 
completes the proof of our claim. 

Now Theorem [23] implies that the number of perfect matchings of B' is at least 
X = ( T -$-) n ' (n')\. Note that for any S Q B\ and T C B 2 with \S\,\T\ > n'/y/rd, 

m 

e B ,(S,T)<e G (S,T) < r G \S\\T\/n + A^(\S\ + \T\) 
<r G \S\\T\/n' + 2r G (\S\ + \T\) 2 /n' 
<br G (\S\ + \T\f/2n\ 
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where in the third inequality we use that 2 < ^/tg(\S\ + \T\)/n' and in the last 
inequality we use that 2|5||T| < (|S'| + |T|) 2 . Set C = 2n' V r G log ri//rB- Now applying 
Lemma [241 with V = B\. V' = B<i, and Vi = a« and v\ = hi for each 1 < i < re' implies 
that \Vc(B')\ < X. So there exists a perfect matching P' in B' such that D(P') has 
at most C cycles. (Recall that D{P') was defined in the paragraph before Lemma [241) 
But we have a one-to-one correspondence between cycles . of D(P') and 
cycles b^a^b^a^ . . . b^a^b^ of PL) P'. Hence PU P' has at most C cycles. Now note 
that C < 3n \J tq log n/r# . Finally, if re is odd then on the cycle of P U P' containing 
the edge ^2^3, we replace 1^3 by the path V2V\v^, so that PU P' is 2-regular. □ 

We now combine Lemmas [28] and [29] to show that under suitable conditions, the 
union of an arbitrary even-regular graph H' and a graph H which is close to being 
pseudorandom contains a collection of edge-disjoint 2-factors which together cover the 
edges of H' . 

Lemma 30. Let G be an rc/n-pseudorandom graph on n vertices with rQ > 21og 2 n, 
and let H be a spanning subgraph of G such that 5(H) > (1 — l/HOO)?^. Let H' be 
an arbitrary r h 1 -regular graph on the same vertex set, and let E^ a d C E(H'). Suppose 
that H' is edge-disjoint from H and that rjji + 1 + 10 6 |£^ a <i|/n < rc/5000. Then there 
exists m < rc/5000 and a collection F\, . . . ,i*2m of edge-disjoint 2-factors in H U H' 
whose union covers all of the edges of H' , such that each Fi has at most 4n\/log n / \fr~Q 
cycles and each set E(Fi) n is a matching of size at most ra/10 6 . 

We will apply Lemma [3D] with E^ being a set of 'bad' edges which we will want to 
avoid when merging the cycles of each Fi into a Hamilton cycle. The purpose of the 
restrictions on F^ n E^ is to spread the edges in E^ out among the i^'s and thus 
make them easier to avoid. 

Proof. Let rn = (1 — l/550)rG + 5. By Vizing's theorem, we can decompose E(H') into 
edge-disjoint matchings M[,M' 2 , . . . ,M' r For each i, split M- into [W 6 \E(Ml) n 
Ebad\/n + lj matchings Mj such that \E(Mj) D E bad \ < n/10 6 . Let Mi, . . . , M m be the 
resulting collection of matchings and note that m < r^r + 1 + W e \Eb a d\/ n < tg/5000. 
Now for each matching Mj, we will find a pair of edge-disjoint 2-factors in H U Mj, 
each with at most Analog n / ^/rc cycles, which together cover the edges of Mj. Thus 
the total number of 2-factors will be 2m. 

Firstly observe that for any v G V(H), no more than 4m — r^i < rc/1200 edges of H 
incident to v will be used during this process to construct our 2-factors (as each 2-factor 
uses up at most 2 edges of H incident to v ). Hence after deleting all the edges of H lying 
in the 2-factors found so far, we still have 5(H) > (1 - 1/1100 - l/1200)r G > r H . For 
each Mj, we use Lemma[28lto decompose Mj into two matchings and complete each one 
to a perfect matching or pseudomatching using edges of H. Now by Lemma I2U1 we can 
complete each of these perfect matchings or pseudomatchings to a 2-factor using edges 
of H, such that each 2-factor produced has at most 3n^rc log njru < in^logn/ y/rc 
cycles. □ 

Finally we combine Corollory [27] and Lemma [30j to fully decompose H U H' into 
2-factors with few cycles in total. 

Corollary 31. Let G be an r q j n-pseudorandom graph with rQ > 2 log 2 re and let H 
be an even-regular spanning subgraph of G of degree rn, with 5(G) — 1 < r h < 5(G). 
Let H' be an arbitrary even-regular graph of degree r^i on the same vertex set and let 
Ebad ^ E(H'). Suppose that H' is edge- disjoint from H and that r^' + l-|-10 6 |£';, af j|/re < 
rc/5000. Let m = (r}j + rj//)/2. Then there exists a decomposition T = {Fi, . . . , F m } 
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of HUH' into 2- factors such that c(F) < 4n\/ tq log 3 n and E(Fi)nEf )a( j / is a matching 
of size at most n/10 6 for each i. 

Proof. Apply Lemma 1301 to construct a collection F' of at most rc/2500 edge-disjoint 
2-factors in H U H' , such that |J F' covers H', 

rx ./x . 4nVlog nr G , 

C{F) ~ 2500^ ^^clogn, 



and the set E(F) n Eb a d is a matching of size at most n/10 6 for each F G J 7 '. Let 
If" = (HL)H')\ (J J 7 ' and note that i?" is an even-regular spanning subgraph of G. So 
by Corollary [27] with H = H" we can decompose H" into a collection F" of 2-factors 
such that c(F") < 3n\/ log 3 n. Taking F = F' UF" , we have 0(7-") < 4n\/ r G log 3 n. 

□ 



7. Merging cycles 

So far we have the necessary tools to find large collections of disjoint 2-factors 
in a pseudorandom graph G, by first finding a regular spanning subgraph and then 
decomposing this subgraph into a collection F of 2-factors. The aim of this section 
is to show that we can transform these 2-factors into Hamilton cycles, by 'merging' 
the cycles of each 2-factor together using edges which are taken from a pseudorandom 
graph G' . The crucial observation here is that G' need not be as dense as G originally 
was; in fact, under certain conditions G' may be taken to be significantly sparser. To 
establish this we make use of the bounds on c(F) proved in Section [6l 

Recall that N(S) denotes the external neighbourhood of S, i.e. N(S) = Uses N(s)\S. 
All paths P are considered to have a 'direction' in the sense that the first and last end- 
points are distinguished (so if P = v\ . . . Vj> and P' = V£ . . . v\ then we view P and 
P' as different paths) . If P = x . . . y then we call x the first endpoint and y the last 
endpoint of P. We define the reverse of a path P = x . . . y to be the path P' = y . . . x 
which has the same vertices and edges as P. 

We will use the rotation-extension method. For this we need the following defini- 
tions. 

Definition 32. Let P = v\V2 ■ ■ ■ vg be a path. A rotation of P with pivot v i is the 
operation of deleting the edge Vi-\Vi from P and adding the edge ViV\ to form a new 
path v; L -iVi-2 ■ ■ ■ v\ViVi + \ . . . V£ with endpoints vi-\ and V£. Call Vi-iVi the broken edge 
of the rotation, and ViV\ the new edge. 

Definition 33. Let P be a path and H a graph with V(P) C V(H), such that P and 
H are edge- disjoint. Let Q C V(H) and let r > 1 be an integer. Then a vertex v of 
P is (H, Q, r) -reachable if there exists a sequence of at most r rotations which make v 
into the first endpoint of P, with all of the new edges being edges of H and all of the 
pivots being elements ofQ. 

The main reason we include the set Q in this definition is that there will be certain 
edges of P that we do not want to break. We achieve this by ensuring that none of 
the endpoints of these edges lie in Q. Let 

(7) t = - — : + 3. 

log log n 

In what follows, the sequences of rotations we consider will generally have length at 
most tq. 
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Definition 34. Let P = V\V2 ■ ■ ■ V£ be a path, and let C = W\W2 ■ ■ ■ w m w\ be a cycle 
which is vertex-disjoint from P. An extension of P to incorporate C , with join vertex 
Wi and broken edge Wi-\Wi is the operation of deleting the edge Wi-\Wi from C and 
adding the edge v\Wi to form a new path wi-\Wi-2 ■ ■ ■ Wi + \WiV\ . . . v^x v i- Call v\Wi 
the new edge of the extension. 

Given a 2-regular graph F and an edge-disjoint graph H on the same vertex set, 
we say that a Hamilton cycle C is formed by merging the cycles of F using edges of 
H if E(C) C E(F) U E{H). Our strategy will be to first merge two of the cycles of 
F together to form a long path P. We then show that if Q C V{H) is 'large', the set 
of (H, Q, To)-reachable vertices of P must in general be large (see Corollary !4Up . This 
will allow us to either extend the path by incorporating another cycle, or close P to a 
cycle, thus reducing the number of cycles in F. Repeating this process will eventually 
produce a Hamilton cycle. 

The following simple and well known lemma (see e.g., [20} Lemma 23], [221 Propo- 
sition 3.1] for similar versions) will give us many reachable vertices provided that H 
'expands', i.e., that Nh(S) n Q is large (compared to S) for any S C Q which is not 
itself too large. We include a proof here for completeness. 

Lemma 35. Let H be a graph on n vertices, and let P = x . . . y be a path on a subset 
of V(H) which is edge-disjoint from H . Let Q C V{P), and let U T be the set of 
(H,Q,t) -reachable vertices of P. Then |?7 T +i| > ^\Njj(U t ) n Q\ — \U T \. 

Proof. For a vertex v G P, let v~ and v + be the predecessor and successor, respectively, 
of v along P. Let T = {v £ Nh(U t ) n Q \ v ~ , v + ^ U T }. If v E T, then since neither v 
nor either of its neighbours on P are in U T , the neighbours of v are preserved by every 
sequence of at most r rotations of P with pivots in Q; i.e., v + and v~ are adjacent to 
v along any path obtained from P by at most r rotations with pivots in Q. It follows 
that one of v~ and v + must be in U T +i. Indeed, starting from P, we can obtain by 
performing at most r rotations a path with endpoints z and y, such that z £ U T and 
zv is an edge of H. Now by one further rotation with pivot v and broken edge either 
vv + or vv~, we obtain a path whose endpoints are either v + ,y or v~ ,y. 

Now let T + = {v + | v G T,v + G U T+1 } and T~ = {v~ \ v G T,v~ G U T+1 }. It 
follows from the above that either |T + | > |T|/2 or |T~| > |T|/2, and both T + and T~ 
are subsets of U T+1 . Hence \U T+1 \ > \T\/2 > (\N H {U T ) DQ\- 2\U T \)/2. □ 



The fact that we have the required expansion property will follow from Lemma [ 
Lemma [38] relies on Corollary [371 which in turn relies on Lemma [36l Lemma [36] is a 
simple consequence of pseudorandomness. 

Lemma 36. Let G be a p-pseudorandom graph on n vertices, and let S,TC V(G) be 
disjoint with s = \S\ and t = \T\. Let 

'2(s + t)logn if l ^>h 
7s(s + t)p otherwise, 



2s\ogn if l ^>h 
7s 2 p otherwise. 



9(s,t) = 

and 

h(s) 

Then ea(S,T) < g(s,t) and ec(S) < h(s). 

Proof. Suppose first that > X. Then Definition [JJh)(i) implies that ec{S,T) < 



sp 

2(s + t)logn = g(s,t). If ^ < \ and (i + \) ^ > \ then Definition E[b)(i) implies 
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that e G (S,T) < 2(s + t)logn < 7s(s + t)p = g(s,t). Finally if (± + ±) ^£5 < 2 then 
Definition [7^b)(ii) implies that e G (S,T) < 7stp < 7s(s + t)p = g(s,t). 

For the second part, if | < < |, then by Definition [T](b) (iii) we have e G (S) < 

2s log n < 7s 2 p = h(s), and otherwise the result follows immediately from Definition 
EDO (iii) and (iv). □ 

Corollary 37. Let G be a p-pseudorandom graph on n vertices. Let S C V(G) with 
\S\ = s, and for each vertex x G S, let A x C Ng{x). Let T = \J xGS A X \S, and t = \T\. 
Then the following properties hold: 

(i) Ifs< 2 -^ andZ x zs 14=1 >12 S logn ; fat>^. 

(ii) Ifs> 2 -^,thent + 3s>^^. 
Proof, (i) By Lemma 1361 we have 

J2\A X \ < e G (S,T) + 2e G (S) < g(s,t)+2h(s) = 2(s + t) logn + 4s log n. 

Hence t > (X) xeS \ A *\ ~ 6s log n)/2 log > Y. xe s |4s|/41ogra. 
(ii) We have 

14,1 < e G (S, T) + 2e G (S) < g(s, t) + 2h(s) = 7sp(s + t) + Us 2 p 

xeS 

and the result follows immediately. □ 

Lemma 38. Let < e < 1 be a constant. Let G' be an r G i jn-pseudorandom graph on 
n vertices and let G be an r G jn-pseudorandom spanning subgraph of G' , with r G < r G i 
and er G > 16 log 2 n. Let H be a spanning subgraph of G such that 



(8) \E(G)\E(H)\ < 2n y / r G logn. 
Let H' be a spanning subgraph of G' , such that 

(9) \E(H)\E(H')\ < and \E(H')\E(H)\ < ^L. 

W^r G ' W^r G ' 

Let Q' , S C V{G) and suppose that |A^/(x) n Q'\ > er G for every vertex x G S. Then 
at least one of the following holds: 

(i) l\N w (S)nQ>\-\S\> \S\ log n, 

(ii) \S\ < ggjj and \\N H ,{S) n Q'\ - \S\ > 

(iii) \S\ < en/90 and %\N H ,(S) D Q'\ - \S\ > en/45, 

(iv) |5| > \Q'\/6, 

(v) ±\N H ,(S) nQ'\- \S\ >\Q'\/6. 

Note that if r G is much smaller than r G i then ([9]) is more restrictive than simply 
requiring the symmetric difference of E{H) and E(H') to be o{e{H)). We need this 
more restrictive bound in Cases 3 and 4 below to obtain good expansion for small sets. 
(See also the remark after Corollary 1441) 

Proof. Let T = N W (S) HQ', s = \S\ and t = \T\. For each vertex x G V(G), let 
B x = Ntf/ (x) n Q' and note that 

(10) Y \B X \ > er G s > 16s log 2 n. 

x£S 

We consider the following cases: 
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Case 1: s < 2 ^ n . In this case we apply Corollary [57[i) with G = G' and A x = B x 
to obtain t > ^If^y > = 4s log n. Hence (i) holds. 

Case 2: < s < f^. Apply Corollary E^ii) with G = G' and A x = B x to 

obtain 

Hence t > and so \t - s > i.e., (ii) holds. 

Case 3: fg- < s < Let A x = B x n iVff(z) and note that 

> er GS >er G |^ > 8\E(H')\E(H)\. 

Hence Z^slM ^ E x es\ B *\ ~ 2 \ E ( H ')\ E ( H )\ > lExesW > 12s log 2 n by ((TO]). 
So Corollary l3"7T i) with G = G implies that t > — 1 |^ n * > 3s log n, and hence (i) 
holds. 

Case 4: max{f^, < s < en/90. Let A x = B x n iV H (a:), and note similarly 

to Case 3 that Ylxes l-^l — I ^xeS \Bx\- Hence Corollary l37T ii) with G = G implies 
that 

t ] 3s > nZ xeS \A x \ ^ nJ2 x€S \B x \ gj3 en 
7sr G ~ 10sr G ~~ 10 

So t > en/15, and t/2 — s > en/45. Hence (iii) holds. 

Case 5: s > en/90. We may assume without loss of generality that s < |Q'|/6 
(otherwise (iv) holds). In this case we must have \Q'\ > en/15. 

Claim: \N w (S)nQ'\ >2\Q'\/3. 

Let Q" = Q'\(Nh'(S) U 5). To prove the claim, suppose for a contradiction that 
\Q'\N H ,(S)\ > \Q'\/3. Then 

\Q"\ > |Q'|/3 - |5| > IQ'l/6 > a > en/90. 
Now since ejj'(S, Q") = 0, we have 

e G (S,Q") < \E(G)\E(H')\ < \E(G)\E(H)\ + \E(H)\E(H')\ 

(SJ,® , — e 2 nr 2 r e 2 nr G 

(11) < 2nJr G log nH t-^- < -. 

v ; _ v c- b -r lo4r ^ _ gooo 

But on the other hand Definition Eta) and ([TJ) imply that 

e G (S, Q ) > 4^(s + \Q |) > - 4n^G 7 > 

contradicting (mj. This proves the claim. Now ±\N H >(S)nQ'\-\S\ > \Q'\/3-\Q'\/6 = 
IQ'l/6 and so (v) holds. " □ 
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For a graph H and a set S C V(H), let Intji(S) be the set of vertices x G V^(-ff) 
such that x G 5 and JVjj-(s) C S. If 5 is a set of vertices such that 5 ^ then 
we take Int H {S) to mean Int H (S n F(JET)). Further let C7#(£) = SU iV# (S). Note 
that V(H)\IntH(S) = Cl(V(H)\S), and that if P is a path then it is possible for an 
endpoint x of P to be in Intp(S). 

Roughly speaking, the following lemma states that rotations (and extensions) of a 
path P do not have much effect on Intp(S), provided that the pivots (or join vertices) 
themselves lie in Intp(S). 

Lemma 39. Let P = x . . . y be a path, and let Q be a set of vertices. 

(i) Let z G Lntp(Q). Suppose we perform a rotation of P with pivot z, and let P' 
be the resulting path. Then Intp(Q)\{z} C Lntp'(Q) C Lntp(Q). Further, if 
x G Q then Lntp/(Q) = Lntp(Q). 

(ii) Let C be a cycle vertex- disjoint from P. Let zz~ be an edge of C and suppose 
that z G IntdQ). Suppose we perform an extension of P to incorporate C 
with join vertex z and broken edge zz~ , and let P' be the resulting path. Then 
(Lntp(Q)U Lnt c {Q))\{z} C Lnt P ,{Q) C Int P (Q)U Lnt c (Q). Further, if x G Q 
then Lnt pi (Q) = Lnt P (Q) U Lnt c (Q). 

Proof, (i) Let z~ be the predecessor of z along P, z the predecessor of z~, and x + 
the successor of x. The only vertices whose neighbourhoods change as a result of the 
rotation are x, z and z~ . However, if x G Lntp(Q) then x + G Q and thus x G Lntp>{Q) 
and vice versa (here we use that z G Q). Similarly, z~ G Lntp{Q) and z~~ G Intp>(Q) 
each hold if and only if z G Q. Since z G Intp(Q), the first part of (i) follows. If 
x G Q then we have z G Lntp/(Q) and hence Lntp/(Q) = Lntp(Q). 

(ii) The proof of (ii) is similar. □ 

The following corollary gives, under fairly weak conditions, a lower bound on the 
number of (H' , Q, ro)-reachable vertices of a long path P. This allows us to make any 
one of a large number of vertices of P into the first endpoint of P via a short sequence 
of rotations. Further, it allows us to 'avoid' a specified set (namely V(P)\Q) while 
doing so. We will use this second property for two main purposes: Firstly in order to 
make sure that certain edges of our 2-factor F which we want to keep are not broken 
during the process of transforming F into a Hamilton cycle, and secondly when we 
want to prevent one half of P from being affected by the rotations at all. In each of 
these cases we need to construct sets satisfying the conditions on Q and Q' in Corollary 
[40l Lemmas 01] and |42] accomplish this. 

Corollary 40. Let < e < 1 be a constant. Let G' be an rQi jn-pseudorandom 
graph on n vertices and let G be an rQ jn-pseudorandom spanning subgraph of G' , with 
r G < r G' an d £r G ^ 16 log 2 n. Let H be a spanning subgraph of G such that (E|) holds, 
and let H' be a spanning subgraph of G' such that (0|) holds. 

Let P = x . . .y be a path in G' such that P and H' are edge-disjoint. Let Q C V(P) 
with x G Q and let Q' C V(G') be such that Q' n V(P) C Lntp(Q). Suppose that 
\Nh'(v) D Q'\ > era for every vertex v G Q. 

Then either 

(i) there exists a {H' , Q' , to) -reachable vertex v of P which has a neighbour in H' 
lying in Q'\V(P), or 

(ii) the set of (H' ,Q' , To) -reachable vertices of P has size at least \Q'\/6. 

Proof. Suppose that (i) does not hold, i.e., that Njp(v) n Q' C V(P) for every 
(H' , Q' , To)-reachable vertex v of P. For each t < to, let U T be the set of (H',Q',t)- 
reachable vertices of P. Assume for contradiction that (ii) does not hold either, i.e., 
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that \U T \ < \Q'\/6 for all r < r . By Lemma [35] with Q = Q' n V(P) we have 
that \U T+1 \ > \\N H '(U T ) n Q'\ - \U T \ for each r < r , since N H >(U T ) flQ'n F(P) = 
N HI (U T )nQ f . 

Note that any rotation of P whose pivot lies in Q' f] V(P) C Intp{Q) produces a 
path P' whose first endpoint lies in Q. Thus Intp/(Q) = Lntp(Q) by Lemma [397i) and 
the same holds for all paths obtained by further rotations with pivots in Q' (~)V(P). So 
U T C Q and hence |iVfr'(v) Pi Q'\ > ere for each v £ U T . Now Lemma [38] with S = U T 
implies that |C/ T +i| > min{ |g^ G , || , | U T | log n} for any t < tq. (Here we use that 
conclusions (iv) and (v) of Lemma [381 cannot hold since \U T \ < \Q'\/6 for all r < tq.) 
Hence there must exist some 



l°g(min{i^,g}) 



n < v = \ ^ + 1 < T - l, 

log log re 

such that |[/ T1 |>min{f^,fg}. 

Suppose first that § > Then |J7 n | > f^-, and hence Lemma[38]with S = U T 

implies that |J7 r +i| > m i n {||) \ U T \ logn} for each r such that t\ < t < tq. Hence there 
exists T2 < to such that | U T2 \ > || . But now setting S = U T2 , none of the conclusions 
of Lemma [381 can hold. So we obtain the desired contradiction. 

On the other hand, if || < lg£° then we already have \U T1 \ > ||. So we derive a 
contradiction in a similar way. □ 

The following lemma will be used to obtain a set of vertices in which the expansion 
property we require still holds, and which also contains no endpoints of any 'bad' edges 
that we want to avoid while rotating. (W will be the set of these endpoints.) 

Lemma 41. Let G' be an tqi /n-pseudorandom graph on n vertices and let G be an 
tq /n-pseudorandom spanning subgraph of G' , with 300 log 3 re < tq < tqi. Let H be an 
even-regular spanning subgraph of G with degree rjj such that 5(G) — 1 < r# < 5(G). 
Let H' be an rn-regular spanning subgraph of G' , such that 

9 

77 r 

\E(H)\E(H')\ < £-=-, 

2500r G / log 2 re 

and let F be a 2- factor of G' which is edge- disjoint from H' . Let W C V(G) with 
\W\ < re/400. 

Then there exist sets V C V(G) and V" C V'\W, such that 

• \Lnt F (V")\ >n-6\W\, 

• \Lnt F (V')\ > n-\W\/ log 2 n, and 

• \N H '(v) n Lnt F (V")\ > r H /2 for every v £ V . 

Proof. Note that by Definition 0(c) , 

5(G) >r G - 2Jr G logn > r G ( 1 == | > 290 log 3 re, 

and hence r^j > 288 log 3 re. 

If W = then we can take V" = V' = V(G), so we assume hereafter that \W\ > 1. 
We define V" as follows: Initially V" = V(G)\W. As long as there exists a vertex 
v £ V" such that \N H >(v) n Jn<ir(V")| < r J H'/ 2 ) w e remove u from V" . Let TV' be 
the set of removed vertices and note that \N}ji(v) n Cl F (W U W')| > rjf/2 for every 
v G TV'. Suppose that this process continues until \W'\ = \ W\. Then for each x £ W' , 
let B x = N H '(x) n Cl F (W U W). Thus we have \B X \ > r H /2 for each x £ W. 
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Note that 

(12) \ B *\ ^ \W\r H /2 > 144 1 W' | log 3 n, 

xew 

and also that \Cl F (WUW')\ < 3(\W\ + \W'\) = 6\W\. Now we separate into two cases: 

Case 1: \W'\ = \W\ < nr H /150r G ,. Applying Corollary [37] with G = G' , S = W and 
A- x = B x , we obtain either 



\Cl F (WUW')\ > 



U ft 



or 



\Cl F (WUW')\ > 



U ft 



xdW 



> 



>^>7|iy| 

81ogn 



Ur G >\W\ 



3\W\ > 7\W\, 



either of which yields an immediate contradiction. 

Case 2: \W'\ = \W\ > nr H /150r G >. Let A x = B x n N H (x). Note that 



xdW 



> 



> 



>4\E(H)\E(H')\, 



2 150r G , 

and that 

E 1^1 > E " 2 \ E ( H )\ E ( H ')\ ^\Y1 l^'l ^ 72|W|log 3 n. 

So we can apply Corollary [37] with G = G and S" = W' to obtain either 

\W\r H 

^ x dL - 

xew 



\Cl F (WUW')\ > 



U *. 



16 log n 



> 7\W\ 



or 



\Cl F {WUW')\ > 



U 

x&W 



> 



n 



either of which again yields an immediate contradiction. 

So the process must terminate before \ W'\ = \W\. Fix V" in its state at the point 
when the process terminates. Note that \V(G)\V"\ < 2\W\ and hence \Cl F (V(G)\V")\ < 
6\W\, i.e., \Int F (V")\ > n - 6\W\. 

Let W" = {v G V(G) | |iV ff /(u) n Cij?(W U W')\ > r H /2} and V = V(G)\W". 
Since W" C WU W we have | W"\ < 2\W\. We will show that V" and V satisfy the 
assertions of the lemma. Since W" <^W'UW = V(G)\V", we have V" C and 
so it remains to prove that |JniF(V')| >n — \ W\/ log 2 n. This holds provided that 

(13) |CZ(tOl < \W\/log 2 n. 

We now claim that \Cl F (WUW')\ > 18\W"\log 2 n. A gain we consider two cases: 

Case 1: < nr H /300r G/ log 2 n. Then applying Corollary E3 with G = G',S = W" 

and A x = B x , we have either 

\Cl F (WUW')\ > Ex f^" lBxl > > 18|^"|log 2 n 

1 v n ~ 41ogn ~ 81ogn ~ 1 1 B 



or 



\W"\nru 

\Cl F (WUW')\ > -L — L-£L -3|W"| > |W"|(191og 2 n-3) > 18|^"|log 2 n, 
14rc \W \ 
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as desired. 

Case 2: \W"\ > nr H /300r G , log 2 n. Note that 

D lBxl - T • snn 2 ^ 4|£(tf)VE(tf')l, 
^rr*„ 2 300r(7/ log n 

and so setting = Nh(x) D .Be again we have 

^ 1^1 ^ E -2|S(^)\^(i2"')l > 5 E 1^1 ^ 72|W"|log 2 n. 
Now again applying Corollary 1371 we have either 

\Cl F (WUW')\ > > 18|W'|log 2 n 

16 log n 

or 

|C7 F (WU W')| > ' „..., - 3\W"\ >—- 3\W"\ > 13\W\ - 6\W\ > 7\W\. 
28rQ\W | 30 

But the latter case cannot occur since |C7.f(WlW)| < 3|WUW'| < 6|W|. This proves 



< - < '^fffir^ < |W|/31og 2 nandso |C/ F (^")| < \W\/log 2 ;,, 

which proves (TL3j) . □ 



Roughly speaking, the following lemma states that if we have a graph H' which is 
close to being pseudorandom and a long path P subdivided into log n segments, then 
most of the vertices will have many neighbours in most of the segments. This will 
enable us to carry out rotations involving only the initial half of a long path in the 
proof of Lemma H3l The proof of Lemma 02] is similar to that of [20} Lemma 22] . 

Lemma 42. Let G be an tq / 'n-pseudorandom graph with r G > 10 5 log 2 n. Let e < 1/5 
be a positive constant and let n' be an integer such that n/10 < n' < n. Let U C V(G) 
be such that \V(G)\U\ < e 2 n'/8, and let H' be a graph on V(G) such that 

(14) \E(G)\Em < §22. 

Let P be a path which is edge- disjoint from H' , with V(P) C V(G) and \P\ = 
n' , divided into logn segments J\ </iogra whose lengths are as equal as possible. 
Then there exists a set L C [logn] such that \L\ > (1 — e)logn and for every i G /, 
there exists J- C Jj n U such that \Intp(J-)\ > (1 — e)n' / logn, which satisfies the 
following condition: For every v £ JL and for all but at most elogn indices j € I, 

|JV H /(v)n/ntp(^)l>2sfe- 

Proof. We define / and {J- \ i £ 1} as follows: Initially I = [logn] and J- = JiCiU. For 
a vertex v and for j E I, call v weakly connected to Jj if \Nh'(v) r\Intp(Jj)\ < 2 5iogn " 
As long as there exists i£l and a vertex »£ J?, such that v is weakly connected to 
J 1 - for more than elogn values of j G we remove u from J 4 '. Further, if at any stage 
there exists i £ I such that \Lntp(J-)\ < (1 — e)n'/logn, then we remove i from /. 

We claim that this process must terminate before e 2 n'/4 — |V(P)\i7| vertices are 
removed. Indeed, suppose we have removed e 2 n'/4— \V(P)\U\ vertices and let R be the 
set of removed vertices. Note that no vertices of V(P)\U are included in R, and that 
\R\ > e 2 n'/8 since \V(P)\U\ < \V(G)\U\ < eV/8. Now \RU(V(P)\U)\ = eV/4, 
and so Y^=i \Int P (J()\ > (1 - 3e 2 /4)n'. Hence we have \Int P (J()\ > (1 - e)n'/logn 
for at least (1 — 3e /4) log n values of i, i.e. , at most 3e log n /4 indices have been removed 
from /. So for each vertex v € R, there remain more than elogn/4 indices i E I for 



32 



FIACHRA KNOX, DANIELA KUHN AND DERYK OSTHUS 



which v is weakly connected to J[. For each i £ I, let WC(i) be the set of vertices 
v G R which are weakly connected to J-. Let J = {i G I \ \WC(i)\ > eV/64}. 

Claim: For each i G Iq, there are at least ^Qiogn° e( ^9 es * n E(G)\E(H') between 
Int P (J() and WC{i). 

Let S = Intp(J-). To prove the claim, note that S n WC{i) = since WC(i) C i?, 
and hence (P) implies that 

e G (S, WC(i)) > |g||TyCW|rG - Vr£(|S| + |WC(i)|) 

n log n 

> |^C(i)|r G / _ 250 log n \ > 3|WC(z)|r G 



50 log n \ y^G / 50 log n 

where the last inequality follows from the condition r G > 10 5 log 2 n. But the definition 

WC(i)\r c 
25 log n 



of WC(z) implies that en'{S, WC(i)) < ^Fff^ , and the claim follows immediately. 



Observe that \WC > |i?|elogn/4 > e 3 n' logra/32, since each v G R is 

weakly connected to J- for more than elogn/4 values of i G /. But now 

E i^wi < ia/.i^ < 

and hence ^j g / |WC(i)| > e 3 n'logn/64. Together with the claim, this implies that 

\E(G)\E(H')\ > Y \WC(i)\ > g3rGn i /1 ° gn > 

1 v M v ;| -501ogn^' Wl 3200 logn " 32000 

which contradicts (|14p . This proves that the process must terminate before e 2 n'/4 — 
|V(P)\i7| vertices are removed. Now |/| > (1 — e)logn at the point at which the 
process terminates, since as before at most 3elogn/4 indices have been removed, and 
so at this point I and { J[ \ i G /} satisfy the assertions of the lemma. □ 

In the next lemma we exhibit a method for merging cycles of a 2-factor F together. 
The basic idea is fairly simple: given F, we first use rotation-extension to transform 
some of the cycles in F into a long path. When we can no longer extend the path, 
we apply Lemma 32] to show that there is a large set Q\ on the first half of the path 
whose vertices are reachable (by rotations which only use vertices in the first half of 
the path as pivots), and a similar set Q2 in the last half of the path. Since Q\ and 
Q2 are large we can find an edge between them and close the path to a cycle, thus 
forming a new 2-factor F' with fewer cycles. However, the fact that we need to avoid 
bad edges forces us to be very careful. 

Lemma 43. Let G' be an r G i / n-pseudorandom graph on n vertices, and let G be an 
rc/n-pseudorandom spanning subgraph of G' with 300 log 3 n < r G < tqi. Let F be a 
2-factor of G' and let H be an even-regular spanning subgraph of G of degree r# such 
that 5(G) — 1 < r# < 5(G). Let H' be an rjj-regular spanning subgraph of G' , such 
that 

(15) \E(H)\E(H')\ = \E(H')\E(H)\< 



2500r G , log z n 
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Let Eb a( i C E(F) be a matching of size at most n/220000. Suppose that F and H' are 



,.2 



edge-disjoint and that 2\Ef ja d\/ log 2 n < c(F) < - 4 "^ a - , 

Then unless F is a Hamilton cycle, we can obtain a new 2-factor F' with the fol- 
lowing properties: 

• E(F') C E(F) U E(H'), 

• c(F') < c(F), 

. \E(F') n E(H')\ < £^ic(F) - c(F')), and 

• E bad CE(F'). 

Proof Let W be the set of endpoints of edges in E bad . Note that \W\ < re/1 10000 
and so by Lemma HQ there exist sets V C V(G) and V" C F'\W, with |Irat F (y")| > 
n — 6|W| and \Intp(V')\ >n — |V7|/log 2 re and such that 

(16) \N H ,(v) n Int F {V")\ > r H /2. 
for every vertex v E V. Note that 

(17) c(F) >2\E bad \/ log 2 n = \W\/ log 2 n>n-\Int F (V')\. 

We will obtain F by deleting an edge yy' in a cycle of F to form a path P, performing 
a sequence of rotations and extensions to incorporate other cycles of F into P, and 
finally closing P to a cycle, thus reducing the number of cycles. The new edges for 
these rotations and extensions will be taken from a graph H" , which is defined as 
follows: Initially H" = H' U {yy'}- Whenever a rotation or extension with new edge 
e G H" is performed, we remove e from 77" and call e a used edge. For each edge of H" 
used we will add back to H" the edge of F broken during the corresponding rotation 
or extension. 

We can view H" as the 'current' version of H'; note that we always have \E(H")\ = 
\E{H')\ or \E(H")\ = \E(H')\ + 1. Similarly, let F' be the current version of F (so F' 
is either a 2-factor of H'UF, or consists of a path and various cycles). Note that since 
we do not change F itself during the proof, the function IntF will not change either. 

As long as we use on average at most 5 log n/ log log n edges per cycle of F merged, 
we have 

\E{H")\E(H)\ < \E{H)\E(H")\ + 1 

< \E(H)\E(H')\ + \E(H')\E(H")\ + 1 
( lg ) ^ nr G + 5c(F) log re + 1 < 2nr 2 G 



2500r G / log n log log n r G < log re 

This fulfils condition Q of Lemmal38land Corollary 1401 whenever they are applied with 
H' = H" . We also have that e(G) < nr^/2 + 2n- s fr~Q since G is (r^/n, 2y / r75)-jumbled 
by Definition [3(a) , and 

e(-ff) = -y > g — - n V^Glogn- - 

by Definition E[c). So since S(iJ) C E'(G), 



(19) \E(G)\E(H)\ <2n^ + n V / r G logn + -<2n V / r G logn. 
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This fulfils condition ([8]) of Lemma [38] (and Corollary I40p . Further, still assuming that 
on average at most 5 log nj log log n edges are used per cycle merged, we have 

\E(G)\E(H")\ < \E(G)\E(H)\ + \E(H)\E(H")\ 



, . mm 2nr 2 r /l\ 3 r G n 

20 < 2nVr G logn+ < ( — )-£—. 

re log n \ 15/ 32000 



Claim: H'[Intp(V')] is connected. 

To prove the claim, suppose for contradiction that H'[Intp(V')] has two components 
S and T. Since (HU) implies that \N H <(v)nInt F (V')\ > r H /2 for any v £ Int F (V), we 
can apply Lemma [38] with e = 1/3, Q' = Int F (V) and S = S. But N H >(S) D Q' = 0, 
and so of the possible conclusions of Lemma [35] only (iv) can hold. This implies that 
\S\ > \Q'\/6 > n/7, and similarly |T| > |Q'|/6 > n/7. Now since G is (r G /n,2y/fg)- 
jumbled, ([I]) implies that 

e G (S,T) > r G \S\\T\/n - tyr5(\S\ + \T\) > nr G /50. 

But by ([20]) this implies that eu'(S, T) > 0. Hence if'[Intj?(y)] has only one compo- 
nent, which proves the claim. 

Our first aim is to find a path P\ = x" . . . y" whose vertices span at least two cycles 
of F and so that x" and y" lie in Intp(y"\ Let C\ be a cycle of F containing a 
vertex of Int F (V'). It is easy to see that IntF(V')\V (C\) is nonempty. Indeed, if 
V = V(G) then it is sufficient to recall that F is not a Hamilton cycle. Otherwise 
\Intp(V')\ < n — 3 and so by (|17p the number of vertices which are not part of C\ 
is at least 3(c(F) - 1) > 3(n - |Jntir(OI - 1) > n - |/nt F (y')l- Hence one of these 
vertices must be in Intp(V). (This is the only place in the proof where we use the 
lower bound on c(F).) 

Since H'[Intp(V')] is connected, there exists an edge xy of H' with x G IntpiV') n 
F(Ci) and y £ /nt F (y)\^(Ci)- Let C 2 be the cycle of F in which y lies. Since E^^ 
is a matching, there will be at least one edge yy' ^ E^ in C2 incident to y. Delete 
yy' from C 2 to form a path C' 2 = y...y'. Note that Int F (V") n V(C^) C Int c ^(V"). 

Again since -E&ad is a matching we can find xx' ^ -Efoad i n Ci incident to x. Perform 
an extension of C' 2 to incorporate C\, with join vertex x and broken edge xx' . Let 
Pq = x' . . . y' denote the resulting path and note that {IntFiV") n V(Po))\{x, y} C 
/nt Po (F"). 

Now by (116j) . a;' has a neighbour zi in iT", which lies in IntF(V")\{x,y}. If z\ € 

V(Po) then we perform a rotation with pivot Z\\ note that the first endpoint x" of 

the resulting path lies in V" . Otherwise, let C be the cycle containing z\ and let 

z\x" ^ Eb a d be an edge of C. Perform an extension of Po to incorporate C and again 

note that the new first endpoint x" will lie in V" . Call the resulting path Pq and let 

Pq = y' ...x" be the reverse of Pq. (Recall that the reverse of a path was defined 

in the second paragraph of Section [7]) Now similarly we can perform a rotation (or 

extension) of Pq with some pivot z 2 (or join vertex z 2 ), so that the new first endpoint 

y" also lies in V" . Call the resulting path Pq" and let Pi = x" . . . y" be the reverse of 

tjiii 

M) • 

Applying Lemma [39] twice with P = Pq and Pq respectively, Q = V" and z = z\ 
and z 2 respectively implies that 

(21) (Int F (V")nV(P 1 ))\{z 1 ,z 2 ,x,y} C Int Pl (V"). 

All rotations performed during the remainder of the proof will have pivots in (IntF(V")D 
V(Pi))\{zi,z 2 ,x,y} (where P\ is the current path). Thus by (J2T|) the pivots lie in 
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Intp 1 (V"). Also the join vertices of all extensions performed during the remainder of 
the proof will lie in Intp(V"). Hence no edges in will be broken. Moreover, the 
endpoints of Pi will always lie in V" and hence Lemma [39] implies that (1211) will always 
hold. 

Note that for each v £ V , we have 

\N H „(v)nInt F (V")\ > \N H ,(v)nInt F {V")\-2 

since H' C H" U F' and A(F') = 2. So (USD implies that 

\N H ,,(v) n Int F (V")\ > r G /3 + 4 

with room to spare. We now incorporate additional cycles into Pi by an iterative 
procedure. Each iteration proceeds as follows: 

Apply Corollary M with e = 1/3, H' = H" , P = P x , Q = V" n V(Pl) and Q' = 
Intp(V")\{zi, Z2, x, y}. We obtain one of two possible conclusions: 

Case 1: There exists an (H" ,Q' ,tq) -reachable vertex v of Pi which has a neighbour 
z G Int F (V")\V (Pi) . Let C be the cycle of F on which z lies. Now we perform the 
necessary rotations to make v the first endpoint of Pi, and then an extension with join 
vertex z to incorporate C into Pi. We then redefine Pi to be the resulting path and 
begin the next iteration. 

Case 2: The number of (H" , Q' , To)-reachable vertices of Pi is at least \Q'\/d> > 3n/20. 
Note that this immediately implies that |Pi| > |Q'/6| > n/10. Let Pi be divided into 
logn segments Jj whose lengths are as equal as possible. Noting that \V(G)\Q'\ < 
6\W\ + 4 < n/18000, we may apply Lemma [H with e = 1/15, H' = H", U = Q' and 
n' = \Pi\- Note that (fT4"|) is satisfied due to (f20j) . Thus we obtain a set / C [logn] of 
size at least 141ogn/15 and sets J[ C JiC\Q' for each i £ I, such that |Intp 1 ( J-)\ > 
14|Pi |/15 logn and the following holds: For every v G J t ', and for all but at most 
logn/15 indices j G /, \N H »(v) n /nt Pl (Jj)| > 25^. 

Note that | IJie/ Intp x {J[)\ > (1 — 2/15) | Pi | . Hence for some i £ I, J[ contains a 
(H" , Q' , ro)-reachable vertex of Pi. Perform the necessary rotations to make one of 
these vertices into the first endpoint, breaking at most tq edges in the process. Redefine 
Pi to be the resulting path, and let P{ be the reverse of Pi. Now apply Corollary |4"U1 
with e = 1/3, H' = H" , P = P{, Q = V" n V(P[) and Q' = Int F (V")\{zi, z 2 , x,y}. 
Again we obtain one of two possible conclusions: 

Case 2a: There exists an (H" ,Q' , To) -reachable vertex v of P[ which has a neighbour 
z G Int F (V")\V(P{). In this case we extend P[ as in Case 1, redefine Pi to be the 
resulting path and begin the next iteration. Note that in future instances of Case 2, 
the sets /, Jj and J[ will be redefined for each new path P[. 

Case 2b: The number of (H" ,Q' , To) -reachable vertices of P[ is at least \Q'\/6 > 
3n/20. Note that any instance of Case 2b is immediately preceded by an instance of 
Case 2, and so the segments Jj have been defined (possibly redefined) so that they 
partition V(P[). Call a segment Jj broken if one of its edges was used as a broken 
edge in one of the rotations in Case 2 (or, for later uses, in Case 2b), and let /' C I be 
the set of all indices i £ I such that Jj is an unbroken segment. Note that since each 
rotation breaks at most one segment, | \J ieI , Intp^(J!-)\ > (1 — 2/15 — ro/logn)|P{| > 
(1 — 3/20)|P{|. Hence for some i £ I', J[ contains a (P", Q' , ro)-reachable vertex of P[. 
Perform the necessary rotations to make one of these vertices into the first endpoint, 
breaking at most tq edges in the process. Call the resulting path P2 = X2 ■ ■ ■ y%- 
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Eventually, the iterative procedure has to conclude by entering Case 2b. Let I" C I 
be the set of all indices i E 7 such that Jj was broken in neither Case 2 nor Case 2b. 
Let 1%, I 2 be subsets of I" such that = \I 2 \ = and for every i\ 6 I\, i 2 G I 2 

the segment precedes J[ where the ordering is taken along P 2 . Now 

|/i|,|/ 2 | > -1/15) log n-2r ) > ^(1 - 1/10) log n. 

Let Qx = {x 2 } U Uig/j J'i, Qi = fe} U U iG / 2 4 and note that 

(22) l/ntftCQaJI, \Intp 2 (Q 2 )\ > I (l - 1 Vl - 1) |P X | > > ^. 

Further, for each vertex u of Qi, and for at least |/i| — logn/15 > logn/3 indices 
i E 1%, we have that |iV#//(u) n/ntp 2 (J|)| > 2 5\og n ( n °ti n g that Intp 2 (Jl) = Intp 1 (J-) 
since J- is unbroken). Hence 

INH^nlnt^Qi)] >r G /75 

for all v £ Qi, and the corresponding statement holds for vertices of Q 2 - 

Now applying Corollary 00] with P = P 2 ,Q = Q 1} Q> = Intp 2 (Qi), H' = H" and e = 
1/75 implies that there exists a set A of |/ntp 2 ((5i)|/6 (iT 7 ', Intp 2 (Q\), ro)-reachable 
vertices of P 2 (since Q' C V(P 2 ), it is impossible for the first conclusion of Corollary HU1 
to hold). Let P' 2 be the reverse of P 2 and apply Corollary [40l again with P = P 2 , Q = Q 2 
and Q' = Intp^(Q 2 ) to obtain a set B of |/ntp/((52)|/6 (H" , Int P ^(Q 2 ), ro)-reachable 
vertices of P 2 . Now > n/150 by (|22p . and so using ([1]) we have 

e G (A,B)> ] - 11 1 G -4^{\A\ + I B\) > — ^- - > 



n 22500 v ~ 30000 

Hence 

e H »U,B) > - \E(G)\E{H")\ 9 0. 

v 7 ~ 30000 ;| 

Now let £32/3 6 Ejj"{A, B). Noting that x% is (H", Intp 2 (Qi), T )-reachable, perform 
a set of at most tq rotations of P 2 with pivots in Intp 2 (Qi), to form a new path P 2 
whose first endpoint is x%. Note that Intp»{Q 2 ) = Intp 2 {Q 2 ). Thus setting P 2 to be 
the reverse of P 2 , we have that 2/3 is still (H" , Intp^i(Q 2 ), ro)-reachable. Perform a set 
of at most To rotations to make 1/3 the first endpoint of P 2 . Note that all the pivots 
of these rotations lie in Q\ U Q 2 C \J ieI J[ C Q', and so as discussed after (pT]) no edge 
in Sfead is broken. Now use X32/3 to close P 2 to a cycle. 

It remains to estimate the number of edges broken during the process. In the initial 
formation of P\ we break only 4 edges (two while forming Pq and one for each of the 
two subsequent rotations or extensions). During each instance of Case 1 or Case 2a 
we break at most tq + 1 edges, during Case 2 we break at most To edges, during Case 
2b we break at most tq edges, and during the remainder of the proof we break at most 
2to edges. Each time we merge a new cycle into P±, we need either to go through Case 
1 once, or through both Cases 2 and 2a once. Case 2b occurs only once (at the end, 
after Case 2). 

Let k\ be the number of times we repeat Case 1 and k 2 be the number of times 
we repeat Cases 2 and 2a. Then the number of cycles of F decreases by k\ + k 2 + 1. 
Further the total number of edges broken is at most 

4 + h(T + 1) + k 2 (2r + 1) + 4r < 4(&i + k 2 + l)r + 4 

So on average the number of edges broken per cycle merged is at most Atq + 4 < 
5 log nj log log n. □ 
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We can now apply Lemma H3l rep eat edlv to transform 2-factors into Hamilton cycles. 
Given a 2-regular graph F and an edge-disjoint graph H on the same vertex set, we 
say that a Hamilton cycle C is formed by merging the cycles of F using edges of H if 
E{C) C E(F) U E(H). E(H) n E(C) is the set of used edges, and E(F)\E(C) is the 
set of broken edges. 

Corollary 44. Let G' be an re /n-pseudorandom graph on n vertices, and let G be an 
rc/n-pseudorandom spanning subgraph of G' with 300 log 3 n <rQ< tqi. Let H be an 
even-regular spanning subgraph of G with degree rjj, such that 6(G) — 1 < ru < 5(G). 
Let J 7 be a collection of edge-disjoint 2-factors Fi,F%, . . . , F m of G' , such that each Fi 
is edge- disjoint from FL. 

Let Eb ac [ C E(\J F) be such that E^ n E(Fi) is a matching and \Ebad n E(Fi)\ < 
n/10 6 for each Fi. Suppose that c(J-)tqi log 3 n < 4nr G . Then we can merge the cycles 
of each Fi into a Hamilton cycle Cj using the edges of H. Further, we can ensure that 
the number of edges of E^ broken during the process is at most |-E& a d|/ log n, and that 
all of the C{ 's are pairwise edge- disjoint. 

When tq is much smaller than tg' the bound on c(F) is more restrictive than 
simply requiring that c(F) is small compared to e(H) as one might at first expect. 
The assumption is necessary due to (|15p which in turn arises from ([9]). In the proof 
of Lemma ST] (and thus of Theorem I48p this assumption will be the limiting factor in 
determining how small we can make rQ compared to tqi , and thus how many iterations 
we need to use. 

Proof. We merge cycles by repeatedly applying Lemma 03j During this process we 
will remove certain edges from FL (namely those which lie in the new 2-factor obtained 
by Lemma |43|) and add certain edges to H (namely those edges which are removed 
from the old 2-factor in Lemma|l3]to obtain the new one). Let FL' denote the 'current' 
version of FL (so FL always denotes the original version). 

We use Lemma H3] repeatedly to reduce c(i^) until c(Fi) = 1, i.e., Fi is a Hamilton 
cycle for each i. We make use of the fact that on average at most 5 log nj log log n 
edges of FL' are used by Lemma 03] for each cycle that needs to be merged. Hence we 
have 

\E(H)\E(H')\ = \E(H')\E(H)\ < 5c ^ lo ^ n < r G n 



log log n 2500r G ' log 2 n 

throughout the process. So (fT5|) is satisfied. 

More precisely, we proceed as follows: Given 1 < i < m, suppose that the number 
of cycles of Fi is at least 2|E^ a d n E(F{)\/ log 2 n. Then we can apply Lemma H31 with 
Fbad = Fb ai i n E(Fi) to merge one or more cycles of Fi together, without breaking 
any edges of E^ and using on average at most 5 log n j log log n edges of FL' per cycle 
merged. (Note that the required upper bound on c(Fi) holds since c(i^) < c(F).) 

On the other hand if the number of cycles is fewer than 2\Eb a d H E(Fi)\/ log 2 n, then 
we can apply Lemma H3] with E^ = repeatedly to merge all of the remaining cycles 
of Fi together, thus forming a Hamilton cycle. Again we use at most 5 log nj log log n 
edges of FL' per cycle and thus the total number of edges of E^ which are broken 
during this process is at most 

2\E had ^E{Fi)\ 51ogn < \E bad nE{F)\ 



log 2 n log log n log n 

The bound on the total number of edges of E bad broken follows immediately. □ 
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For a given application of Corollary HH during the remainder of the paper, define 
the leftover graph to be the final state of H' . Thus H' is obtained from H by deleting 
the edges of the Hamilton cycles produced by Corollary EH and adding those edges of 
U J- which do not lie in any of these Hamilton cycles. 

We now prove variants of Lemma [43] and Corollary I44L These will be used in cases 
where instead of needing to avoid a set of bad edges when rotating, we only want to 
avoid a vertex x$ (when we apply the lemmas xq will be the vertex of minimum degree 
in G n ,p)- For clarity we prove these as separate lemmas as the conditions are different, 
but the proofs proceed along similar lines in each case. 

Lemma 45. Let G' be an tq< In-pseudorandom graph on n vertices, and let G be an 
rc/n-pseudorandom spanning subgraph of G' with 10 5 log 2 n < rQ < tqi. Let xq be a 
vertex of G and let H be formed from G by removing all of the edges of G incident to 
xq. Let H' be a spanning subgraph of G' , such that \15}) holds and dn(v) = djj/(v) for 
each v G V(G). 

Let F be a 2-f actor of G' , edge- disjoint from H', such that c(F) < - — ^ 3 - . Then 

unless F is a Hamilton cycle, we can obtain a new 2-factor F' such that the following 
hold: 

• E(F') C E(F) U E(H'), 

• c(F') < c{F), and 

. \E{F>) n E{H')\ < £^(c(F) - c(F')). 

Proof The proof is similar to that of Lemma [431 We will again obtain F' from F by 
performing a sequence of rotations and extensions, with all the new edges for these 
rotations and extensions taken from a graph H" which is defined as in the proof of 
Lemma S3] Again, we say that an edge e of H" is used if it is the new edge of some 
rotation or extension performed during the proof. Define the current version F' of F 
as in the proof of Lemma 03] Observe that e(H) > e(G) — n, and so assuming we use 
on average at most ^, g °^ n edges for each cycle of F merged, the bounds ([T8]) . (fl~9]) and 
(1201) still hold. Moreover, 

5(H" - {x }) > 5{H' - {x }) = 5(H - {x }) 

(23) > 5(G) - 1 > r G - 2^r G logn - 1 > y- 

Claim: H' — {xq} is connected. 

To prove the claim, suppose for a contradiction that H' — {xq} has two components, 
S and T. By ([23]) we can apply Lemma 1381 with e = 1/3, Q' = V(G)\{xo} and S = S. 
But now N H /__{ Xo y(S) = 0, and so of the possible conclusions of Lemma [38] only (iv) 
can hold. This implies that | *S' | > (n — l)/6 > n/7. Similarly \T\ > (n — l)/6 > n/7. 
Now since G is (rc/n, 2 y/rc) -jumbled we have that 

e G (S,T) >r G \S\\T\/n-4yfF5(\S\ + \T\) >nr G /50. 

But by ([20]) this implies that ejj'{S,T) > 0, which contradicts our assumption that S 
and T were components. Hence H' — {xo} nas only one component, which proves the 
claim. 

Let C\ be a cycle of F . Since H' — {xo} is connected, there exists an edge xy of 
H' joining two distinct cycles C\ and C2 of F, with x on C\ and y on C2 and such 
that x,y / xo- Let yy' be an edge of C2 incident to y such that y' 7^ xq. Delete yy' 
from C2 to form a path C' 2 - Let xx' be an edge of C\ such that x' 7^ xq, and perform 
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an extension of C 2 with join vertex x and broken edge xx' to incorporate C\. Let 
P\ = x' . . . y' denote the resulting path. Note that xq cannot be an endpoint of Pi. 

Let V" = V{G)\{x } and Q' = Int F (V"). Note that Int Pl {V") = Q' D V(P%). All 
rotations performed during the remainder of the proof will have pivots in Intpiy") D 
V{P\), where Pi is the current path. Similarly, all extensions performed during the 
remainder of the proof will have join vertices in Q' . Thus by applying Lemma [39] with 
Q = V" , we always have that 

Int Pl (V") = Int F {V") n V(P]) = Q'n V{Pi). 

So in particular xq will never be an endpoint of Pi. 

By ([23]) we can apply Corollary M with e = 1/3, H' = H" , P = Pi, Q = V"r\V(Pi) 
and Q 1 = Q' and use the same case analysis as in Lemma H31 The remainder of the 
argument is also identical to that in the proof of Lemma US (Note that xq ^ J- for 
any i since J- C JiC\Q' , and hence xq ^ Q\ and xq ^ Q2O D 



Corollary 46. Let G' be an tqi /n-pseudorandom graph on n vertices, and let G be 
an rain-pseudorandom spanning subgraph of G' with 10 5 log 2 n <tq< r<ji . Let xq be 
a vertex of G and let H be formed from G by removing all of the edges of G incident 
to xq. Let J 7 be a collection of edge-disjoint 2-f actors F\,F2, . . . ,F m of G' , such that 

each Fi is edge-disjoint from H and c(P) < - 4 "^ G a ~ ■ Then we can merge the cycles of 
each Fi into a Hamilton cycle Cj using the edges of H , such that the Cj 's are pairwise 
edge-disjoint. (Recall that merging was defined in the paragraph before Corollary \44\) 

Proof. We merge cycles by repeatedly applying Lemma U5j During this process we 
will remove certain edges from H (namely those which lie in the new 2-factor obtained 
by Lemma |4"5|) and add certain edges to H (namely those edges which are removed 
from the old 2-factor in Lemma 05] to obtain the new one). Let H' denote the 'current' 
version of H (so H always denotes the original version). 

We use Lemma SS] repeatedly to reduce c(Pj) until c(Pj) = 1, i.e., P« is a Hamilton 
cycle for each i. We make use of the fact that on average at most 5 log nj log log n 
edges of FI' are used by Lemma 05] for each cycle that needs to be merged. So 

\E(H)\E(H')\ = \E(H')\E(H)\ < 5c ^ lo ^ n < ^-5-, 

loglogn 2500r G /log 2 n 

and hence (fT5j) is satisfied throughout the process. □ 



8. Completing the proof 

In this section we combine our results to prove Theorem [2j Roughly speaking, the 
following lemma states that given a graph Hq which is close to being pseudorandom 
and given about logn pseudorandom graphs Hi, ... , P~2 m +i, we can find a set of edge- 
disjoint Hamilton cycles in the union of these graphs which cover all of the edges of 
Hq. While the Hi cannot be too sparse, they need not be as dense as Hq. The point 
is that the remaining 'uncovered' graph is much sparser than Hq and is also not too 
far from being pseudorandom. In the proof of Theorem 1481 we apply this lemma three 
times in succession to obtain an uncovered graph which is very sparse. 

Lemma 47. Let pq > log 14 n/n, and let pi > ((npo) 3 log 10 n) 1 / 4 /^. Let m = ^fj^jp^ 
and let P2, ■ ■ ■ ,P2m+i be positive reals such that pi = p\ for odd i and pi = 10 10 pi 
for even i. Let Gq be a pQ-pseudorandom graph on n vertices. Suppose that Gi is 
a pi-pseudorandom spanning subgraph of Gq, and let Hi be an even-regular spanning 
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subgraph of Gi such that 5{Gi) — 1 < 5 {Hi) < 5(Gi), for each 1 < i < 2m + 1. Suppose 
that the graphs Gi are pairwise edge-disjoint for 1 < i < 2m + 1 and let Hq be an 
even-regular spanning subgraph of Go which is edge- disjoint from Ui™i +1 ^«- Then 
there exists a collection TIC of edge-disjoint Hamilton cycles such that Hq C [jTiC C 

Formally the assumption po > log 14 n/n can be omitted. It is included for clarity 
since if pq is significantly smaller then pq < Y^a=\ 1 Pi anc l so the lemma becomes 
vacuous. 

To prove Lemma [37J we first decompose Hq into 2-factors which on average have 
few cycles. We then use edges of H\ to transform these 2-factors into Hamilton cycles. 
Because edges are exchanged between H\ and the 2-factors there will still be some 
edges of Hq left uncovered (the 'bad' edges). We decompose H[ U H2 (where H[ is the 
leftover of Hi and Hq) into 2-factors with few cycles and then use H3 to transform them 
into Hamilton cycles (we cannot decompose H[ on its own since it is no longer close 
to being pseudorandom). Again some edges of Hq will be left uncovered, but we can 
guarantee that the number of such edges will be reduced (by a factor of about logn). 
After about logn/ log logn iterations we arrive at a leftover graph which contains no 
edges of Hq, i.e., all of the edges of Hq are covered. 

Proof of Lemma\42\ Note that pi > log 13 n/n. Corollary [27] with G = Gq and H = Hq 
implies that Hq can be decomposed into a collection T\ of 2-factors such that c{F\) < 
3ny/ npo log 3 n. Since 

(24) c(J"i) • np log 3 n < 4n(np log 3 n) 3/2 < An(npi) 2 , 

we may apply Corollary 03] with G' = Gq, G = G\, H = Hi, T = T\ and E^ a( i = 0. 
This allows us to merge the cycles of each 2-factor of J~i to form a collection of edge- 
disjoint Hamilton cycles. Let H[ C Hq U Hi be the leftover graph from Corollary 031 
(as defined after the proof of Corollary 1331) . and let E bad = E{H[) n E(Hq). Note that 
\E bad \ < \E(H[)\ = \E(Hi)\ < n 2 Pl /2. 

Now H[ is even-regular with degree at most npi, and npi + l + 10 6 n»i/2 < np2/5000. 
Hence we may apply Corollary [31] with G = G2, H = H2 and H' = H[ to obtain a 
decomposition T2 of H[ U H2 into 2-factors, such that c(J r 2) < 4ny np2 log 3 n and 
E(F) D -Ebad is a matching of size at most n/10 6 for each F £ J-2. 

By ([231) (noting that pi = ^3) we may apply Corollary [33J with G' = Gq, G = G%, 
H = H3, T = T2 and E bad = E bad . This yields another collection of edge-disjoint 
Hamilton cycles and a leftover graph H' 3 . Furthermore, if we redefine E^ad = E(H' 3 ) n 
E(Hq), then \Ef, a d\ is reduced by a factor of logn. 

We now repeat this process a further m — 1 times, using up the graphs H4, H$, . . . , 
H 2m+ i. Now we have \E(H 2m+1 )nE(H )\ < < 1, i.e., E(H' 2m+1 )nE(H ) = 0. 

Let HC be the union of all the collections of Hamilton cycles produced by this process 
and note that H CpCC \J^=o +1 H i- D 

Before proving Theorem [21 we state a 'pseudorandom' version of the theorem. The 
conditions in this version are significantly more complicated than those of Theorem [21 
however, it has the advantage of being entirely deterministic and we believe it to be 
of independent interest. 
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Theorem 48. Let log 50 n/n < po < 1 — n l / 4 log 9 n. Let 

(npo) 3 ^ 4 log 7//2 n (nj>2) 3 / 4 log 7 / 2 n 

P2 = , Ps = , 

n n 

(np 3 ) 3/4 log 7/2 n ^npoil-po) 

(25) P4 = , P5 ~ 



n ' n log 3 / 4 n 

and pi = po — p2 — P3 — Pa — P5 ■ For i = 2,3,4 let 

2\og(n 2 pi) 



log log n 



For l<j<2 mi + l let p (iJ) = (10 i U+ p J )m . +1 if j is odd and p (iJ) = pi^^ if j is 
even. 

Let Gq be a p^-pseudorandom graph on n vertices. Suppose that Gq has a decom- 
position into graphs Gi, G2, G3, G4, G5, and that Gi has a decomposition into graphs 
G(i,i)iG(i,2)i ■ • • ■> G(i,2m,i+i) f or * = 2, 3, 4, such that the following conditions hold: 

(i) Gi is pi-pseudorandom for each 1 < i < 5, 

(ii) Guj\ is puj)- pseudorandom for i = 2,3,4 and 1 < j < 2rrii + 1, 

(iii) Gi U Gi+i is (pi + Pi+i) -pseudorandom for i = 2,3, 4, and 

(iv) Go is Au- jumping where 

(26) n = 2np 5 = — . 

log J/ n 

Then Gq has property % . 

For the moment we will assume the truth of Theorem 08] and use it to prove Theo- 
rem [2j 

Proof of Theorem^ Let po = p and let Go ~ Gr np . Define as in the statement of 
Theorem [J8] the real numbers pi for 1 < i < 5, integers mi for z = 2,3, 4, and reals 
Puj) for i = 2, 3, 4 and 1 < j < 2mj + 1. 

Form graphs Gi,Gs and Guj\ for z = 2,3, 4 and 1 < j < 2mj + 1 as follows: For 
each edge e of Go, place e in G\ with probability pi/po, in G5 with probability p^/po, 
and in G(j j\ with probability Pajj/po for each z = 2,3,4 and 1 < j < 2rrii + 1. Let 

Gi = US +1 G (m) fo^ = 2, 3, 4.' 

Note that p« = o(po) for z = 2, 3, 4, 5 and that 



(27) nps > log 24 n, npi > log 14 n and npuj\ > log 13 

for z = 2,3, 4, where the second inequality holds since x > log 14 n implies that 
x 3//4 log 7 / 2 n > log 14 n. Thus the bounds on each pi and P(£,j) in Lemma [16] hold, 
and hence Lemma [16] implies that Go is po _ P seu dorandom and that conditions (i) , (ii) 
and (iii) of Theorem 1481 hold whp. Moreover, condition (iv) holds whp by Lemma [T7l 
Hence Theorem 1481 implies that Gq has property %. □ 
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It remains to prove Theorem! 

Proof of Theorem 48 Recall from the proof of Theorem [2] that pi = o(po) for i 
2,3,4,5. Thus 

(28) pi = (1 - o(l))po- 
Note also that (|27p holds and 

(29) np 4 = (np ) 27/M logi( 1+ ! + ^) < (np ) 27/64 log 49/6 n. 
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Let xq be the vertex of Go of minimum degree. If 5 (Go) is odd, then at this point 
we use Lemma [227i) to remove an optimal matching Mo p t which covers x$ from G±, 
and let G[ be the remainder. If 5(Gq) is even then let G[ = G\ and Mo P t = 0- 

Form from G5 by removing all of the edges incident to xq, and add the removed 
edges to G'i- For each Guj\, apply Lemma[22jii) with u = to form a regular spanning 
subgraph Hnj\ whose degree is either 8(Gnj\) (if 5(G^^) is even) or 5(G^j^) — 1 
(otherwise). If there are edges of G^^\H^^ which are incident to xq, move these 
edges into G[. Let Hi = (Jj=i +1 f° r * = 2,3,4. Now all edges of Go which are 

incident to xq lie in G[ U H 2 U H3 U H4 U Mopt, i.e., 

(30) d Go (x ) = d &i (x ) + (xq) + dtf 3 (zo) + d Hi (x ) + dAf 0pt (zo)- 
Let Y^ij denote the summation Ylt=2 Sj=i +1 - Note that (f30|) implies that 

(31) d Gl (x )-l < d G ,(x ) < 6(G )-2j2^(G(i !j ))m < 5(G )-^(G {hj) )-l). 

The next claim shows that the number of edges incident to xq which we added to G\ 
to form G' x is at most u. 

Claim 1: 

(32) A(G 5 ) + ( A (%i)) ~ + !)<«• 

hi 

Indeed, by Definition E[c) we have that 

A(G 5 ) < np 5 + 2Vnp 5 logn < — — = y. 

Further, note that 

(33) (5 log 4 n) 8 <npo(l-p ) 4 . 
Also, Definition E^c) implies that 



A ( G (2,j)) - S i G (2,j)) < 4 y n P(2,j) lo g n < W n P2 logn 

for each 1 < j < 2to2 + 1. Hence using m 2 = o(logn), we have 

2m 2 + l 

(A(G (2j) ) - S(G (2J) ) + 1) < 4(2m 2 + 1) (v> 2 log n + ^ 
j'=i 

< log n\Jnp2 logn = log 13//4 n(npo) 3//8 
" 51og 3 / 4 n 5 10' 

Similarly 

2m, + l 

£ (A(G (i>j) ) - + 1) < lognV^Pilogn < ^ 

for i = 3,4. Thus the left-hand side of (|32p is at most 2u/3 + 3(ii/10) < u, which 
proves the claim. 

Claim 2: Each x 7^ xq satisfies dc 1 (x) > (feiO^o) + 2u. 
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In other words, xq is the vertex of minimum degree in G\ and G\ is 2u-jumping. To 
prove Claim 2, recall our assumption that Go is 4n-jumping. Hence we have 



d Gl (x) > 6(G ) + 4n - A(G 5 ) - £ A(G (iJ) ) 



M ^ ED 

> 6{G )-^2(5(G {iJ) )-l) + 3u > d Gl (x )-l + 3u>d Gl (x ) + 2u, 



hi 



which proves the claim. 

Since each Hnj) is even-regular and since Mo p t covers xo if and only if dc (xo) is 
odd, ([30]) implies that d,Q> (xo) is even. Also recall that the number of edges added to 
Gi at the vertex xo (after removing Mo p t) to form G[ is at most the left-hand side of 
(f32]h and hence is at most u. So xo is the vertex of minimum degree in G\. Moreover, 
since 1 — po < 1 — pi , 



mm 2^n-2p l {l-p l ) 

U < - 3/4 <4V^1(1-Pl)- 

log ' n 

Hence we may apply Lemma [22] with G = G\ and G' = G\ to form a regular spanning 
subgraph Hi of G' t with degree 5(6?^) = d G i (xq). Note that Hi contains every edge of 
G'i incident to xq. 

By Lemma l47l where po = po, Pj = P(2,j) f° r 1 < i < 2?7i2 + 1, Go = Go, Hq = Hi, 
Gj = G( 2 j), and Hj = -£f(2,j) for 1 < j < 2m 2 + 1, there exists a collection HCi of 
edge-disjoint Hamilton cycles in Hi U H 2 which cover the edges of H\. Let H 2 be the 
graph formed by the edges of H 2 which are not contained in one of these Hamilton 
cycles. 

Applying Lemma [J7] again with po = P2 + P3, Pj = P(3,j) f° r 1 < i < 2m,3 + 1, 
Go = G 2 U G3, -Ho = H' 2 , Gj = G(3 j), and Hj = Hr Sf j\, we obtain a collection of 
edge-disjoint Hamilton cycles in H' 2 UH^ which cover H 2 . Let H' 3 be the graph formed 
by the edges of #3 which are not covered by one of the Hamilton cycles. 

Applying Lemma 07] again with po = P3 + Pi-, Pj = P(4,j) f° r 1 < j < 2ra^ + 1, 
Go = G3 U G4, Hq = H 3 , Gj = G( 4 j), and Hj = -ff(4j), we obtain a collection H.C3 of 
edge-disjoint Hamilton cycles in H' 3 L)H/i which cover H' 3 . Let H' 4 be the graph formed 
by the edges of H^ which are not covered by one of the Hamilton cycles. 

Note that H' 4 is a subgraph of G4. Hence by Corollary [27] we can find a decomposition 
J- of H' A into 2-factors such that c(J-) < 3n\/ np^ log 3 n. Now we claim that 

(34) 36(np 4 ) 3 log 9 n < (np 5 ) 4 . 

To prove ([54"]) . note that (nps) 4 = (npo(l— po)) 2 / log 3 n. So by ([2T?]) it suffices to prove 
that 36(np ) 81/64 log 67/2 n < (np (l -p )) 2 / log 3 n, or equivalently that (np ) 47/64 (l - 
Po) 2 > 36 log 73 / 2 n. But if p < 1/2 then we have 

(n Po ) 47/64 (l - Po) 2 > (np ) 47/64 /4 > (log 50 n) 47 / 64 /4 > 36 log 73 / 2 n, 

and if po > 1/2 then 

(np ) 47 / 64 (l - po) 2 > n 47 / 64 (n" 1 / 4 ) 2 /2 > 36 log 73 / 2 n, 



with room to spare, which proves ([34 

It follows immediately from (|34p that if ps < P4, then 



n(np^,) 2 n(np^ 2 



c{F) < 3ny np4 log n < 3 — < 



2np4log n n(p4+p5)log n 
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On the other hand if p$ >Pa, then 



c(J r ) <3ny np4 log n < 3ny np$ log n < 3 — < 



n(np 5 ) n(np 5 ) 5 



2 log 3 re ra(^4 +P5) log 3 n 

Hence in either case we can apply Corollary 06] with G' = G4U G$, G = G§, H = H5 
and J- = J- to obtain a collection HC4 of edge-disjoint Hamilton cycles in H'^UH^ which 
cover H' 4 . Now let UC = HC X U UC 2 U %C 3 U HC 4 . Observe that %C covers every edge 
of Hi, H2, H3 and H^. But recall that every edge of Go incident to xq is contained in 
either Hi, H2, H3, H4 or Mo P t- Hence HC contains exactly [_d,G (xo)/2\ = [5(Go)/2\ 
Hamilton cycles. □ 

Note that the only place where we use the full strength of the condition on pq is 
in the proof of f|33 [) and (|34p . Also note that if we omit one of the iterations (i.e., if 
instead of defining p±, G4 and H4 we simply use H§ to finish the decomposition of #3) 
then the proof of Theorem [2] still works as long as log 125 n/n < p < 1 — n -1 / 7 (say). On 
the other hand, we could have improved the lower bound on p in Theorem [2] somewhat 
by adding extra iterations. However, even a large number of iterations will only reduce 
the lower bound to approximately log 30 n/n. Some further small improvements could 
be made by using tighter calculations in some places. 
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